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MANIPULATION OF ORGANIC ACID BIOSYNTHESIS AND SECRETION 2 

The present Invention relates to nucleic add fragments encoding amino 
acid sequences for organic acid biosynthetic enzymes in plants, and the use 
thereof for the modification of organic acid biosynthesis and secretion in plants. 

Documents cited in this specification are for reference purposes only and 
their inclusion is not acknowledgment that they fomi part of the common general 
knowledge in the relevant art. 

Oiganic acids, such as citrate and malate. are key metabolites in plants. 
They are involved in numerous processes, including 04 and Crassulacean acid 
metabolism (CAM) photosynthesis, stomatal and pulvlnual movement, nutrient 
uptake, respiration, nitrogen assimilation, fatty acid oxidation, and providing 
energy to bacteroids In root nodules. For example, malate plays a key role in root 
nodule metabolism and nitrogen fixation, serving as ttie primary carbon source for 
bacterold maintenance and nitrogenase activity, and is also tightiy linked to nodule 
nitrogen assimilation. Furttiermore. the complexing role of organic acids produced 
and excreted ftbrn plant roots has also been associated with tolerance to the 
aluminium cation Al^* which is toxic to many plants at micromolar concentrations. 
Aluminium toxicity has been recognized as a major limiting factor of plant 
productivity on acidic soils, which account for approximately 40% of tiie earth's 
arable land. 

The tricarboxylic acid cycle (TCA). also known as Krebs cycle (after its 
discoverer Hans Krebs) or citric acid cycle, moves electrons from organic acids to 
the oxidized redox cofactors NAD* and FAD. forming NADH. FADHa. and carbon 
dioxide (CO2). The reaction sequence of the TCA cycle involves: in a reaction 
catalyzed by citrate synttiase (CS). acetyl-CoA fomned by the pymvate 
dehydrogenase complex combines wfth oxaloacetate to produce the Cs 
tricarboxylic add. citrate. In tiie overall cycle, tiie citrate is oxidized to produce two 
molecules of CO2 in a series of reactions ttiat leads to the fonmation of one 
oxaloacetate. three NADH. one FADH2. and one ATP. The resulting oxaloacetate 
) reacts with anotiier molecule of acetyl-CoA to continue tiie cyde. The oxidative 
decarboxylation of pymvate yields an additional CO2 and NADH. Thus the TCA 



cycle brings about the complete oxidation of pyruvate to three CO2 plus 
electrons, which are stored temporarily as 4 NADH and 1 FADH2. 



Cytosolic reactions generate products that are transported into the 
mitochondria to feed the TCA cycle. The nature of the end product of the glycolytic 

5 reactions In the cytosol of plants Is determined by the relative activities of the three 
enzymes that can utilize phosphoenol-pymvate (PEP) as substrate. Both pyaivate 
kinase and PEP-phosphatase fbnn pyruvate; while PEP-carboxylase (PEPC) 
generates oxaloacetate. Pymvate is transported directly into the mitochondrion. 
Oxaloacetate is either transported directly into the mitochondrion or first reduced 

10 to malate by cytosolic malate dehydrogenase (MDH). 

Before entering the TCA cycle proper, pymvate is oxidised and 
decarboxylated by the pyaivate dehydrogenase enzyme complex to form CO2. 
acetyl-CoA, and NADH. The pymvate dehydrogenase enzyme complex, which 
requires the bound cofactors thiamine pyrophosphate, lipoic acid, and FAD as well 
1 6 as free coenzyme A (CoASH) and NAD*, links the TCA cycle to glycolysis. 

It is known that the TCA cycle includes the following enzymes: pyruvate 
dehydrogenase, citrate synthase, citrate hydrolase, isocitrate dehydrogenase, 
oxoglutarate dehydrogenase. succinyl-CoA synthetase, succinate dehydrogenase, 
fumarase, malate dehydrogenase. NAD-malic enzyme and phosphoenolpyruvate 
20 carboxylase. 

In particular, citrate synthase (CS) catalyzes the condensation of acetyl- 
CoA and oxaloacetate to fonn the C6 molecule citrate and free CoASH. as the 
TCA cycle proper begins. 

Malate dehydrogenase (MDH) catalyzes the final step of the TCA cycle, 
25 oxidizing malate to oxaloacetate and producing NADH. This reaction catalyzed by 
MDH is reversible, thus allowing also for the reversible reduction of oxaloacetate 
to malate. The enzyme MDH is important in several metabolic pathways, and 
higher plants contain multiple fonms that differ in co-enzyme specificity and 
subcellular localization. Chloroplasts contain an NADP*-dependent MDH that 



plays a critical role in balancing reducing equivalents between the cytosol and 
stroma. Plants also contain NAD-dependent MDHs which are found in a) 
mitochondria as part of the TCA cycle; b) cytosol and peroxisomes involved in 
malate-aspartbte shuttles; and c) glyoxisomes functioning in p-oxidation. In root 
nodules of nitrogen-fixing legumes, such as white clover {Trifolium repens) and 
alfalfa {Medicago sativa), malate serves as the primary carbon source to support 
the respiratory needs of the bacterial microsymbiont and the fixation of Nz by 
nitrogenase. and a nodule-enhanced MDH is thus critical for nodule function. 

Phosphoenolpyruvate carboxylase (PEPC) catalyzes the reaction of 
phosphoenol-pynjvate with HCO3 releasing the phosphate and producing the C4 
product, oxaloacetate. Oxaloacetate is commonly reduced to malate by NADH 
through the action of malata dehydrogenase (MDH). PEPC is a homotetrameric 
enzyme widely distributed in most plant tissues. In plants. PEPC fulfils various 
physiological roles such as the photosynthetic CO2 fixation in C4 and 
Ciassulaceari Acid Metabolism (CAM) plants, and the anaplerotic pathway. 

While nucleic acid sequences encoding some organic acid biosynthetic 
enzymes have been isolated for certain species of plants, there remains a need for 
materials useful in modifying organic acid biosynthesis; in modifying organic acid 
secretion: in modifying phosphorous acquisition efficiency in plants; in modifying 
aluminium and add soil tolerance in plants; in modifying nttrogen fixation - and 
nodule function, particularty In forage legumes and grasses, including alfalfa, 
medics, clovers, ryegrasses and fescues, and for methods for their use. 

This Invention Is directed towards overcoming, or at least alleviating, one or 
more of the difficulties or deficiencies associated with the prior art. 

5 In one aspect, the present invention provides substantially purified or 

isolated nucleic acids or nucleic acid fragments encoding the organic add 
biosynthetic enzymes CS. MDH and PEPC. from a clover [Trifolium). medic 
{Medicago), ryegrass {Lolium) or fescue (Festuca) spedes. or functionally active 
fragments or variants thereof. 



The present invention also provides substantially purified or isolated nucleic 
acids or nucleic acid fragments encoding amino acid sequences for a class of 

. proteins from a clover {Trifolium), medic {Medicago), ryegrass {Lolium) or fescue 
{Festuca) species which are related to CS. MDH and PEPC. or functionally active 

5 fragments or variants thereof. Such proteins are refenred to herein as CS-llke. 
MDH-like and PEPC-IIke respectively. 

The present invention also relates to individual or simultaneous 
enhancement or otherwise manipulation of CS. MDH and/or PEPC or like gene 
activities in plants to enhance or otherwise alter organic acid biosynthesis; to 
1 0 enhance or reduce or othenwise alter organic acid secretion; to enhance or reduce 
or otherwise aHer phosphorous acquisition efficiency In plants; to enhance or 
reduce or othenwise alter aluminium and acid soil tolerance in plants; and/or to 
enhance or reduce or othenvise alter nitrogen fixation and nodule functton In 
legumes. 

5 The individual or simultaneous enhancement or otherv^^ise manipulation of 

CS. MDH and/or PEPC or like gene activities in plants has significant 
consequences for a range of applications in. for example, plant production, plant 
perfomiance, plant .nutrition and plant tolerance. For example, it has applications 
in Increasing plant tolerance to aluminium-toxic acid soils; in improving plant 
20 nutrient acquisition efficiency for example In Increasing acquisition of phosphorus 
from soils; In increasing nodule functton In nitrogen-fixing legumes for example 
leading to enhanced nitrogen fixation; in modifying the accumulation of organic 
acids such as citrate in fruits; in modifying the secretion of organic acids for 
example citrate and/or malate from plant roots. 

25 Manipulation of CS. MDH and/or PEPC or like gene activrtles In plants. 

Including legumes such as clovers {Trifolium species), lucerne (Medicago sativa) 
and grass species such as ryegrasses {Lolium species) and fescues {Festuca 
species) may be used to facilitate the production of. for example, forage legumes 
and forage grasses and other crops with enhanced tolerance to aluminium toxic 

30 soils; enhanced nutrient acquisition efficiency; forage legumes with enhanced 



nitrogen fixation; fruits witli enlianced organic acid content leading to 
flavour and health benefits. 



The clover {Jrffolium), medic {Medicago), ryegrass {Lolium) or fescue 
iFestuca) species may be of any suitable type, including white clover {Trifolium 

5 mpens). red clover (Trifolium pmtense), subten^nean clover {Trifolium 
subterraneum), alfalfa (Medicago sativa). Italian or annual ryegrass {Lolium 
mumorum), perennial ryegrass {Lolium perenne). tall fescue {Festuca 
arundinacea). meadow fescue {Festuca pratensis) and red fescue {Festuca rubra). 
Preferably the species is a clover or a ryegrass, more preferably white dover (7. 

10 repens) or perennial ryegrass (L. perenne). White clover {Trifolium repens L.) and 
perennial ryegrass {Lolium perenne L.) are Icey pasture legumes and grasses, 
respectively, in temperate climates throughout the worid. Perennial ryegrass is 
also an important turf grass. 

The nucleic acid or nucleic acid fragment may be of any suitable type and 
1 5 includes DNA (such as cDN A or genomic DN A) and RNA (such as mRNA) that is 
single- or double-stranded, optionally containing synthetic, non-natural or altered 
nucleotide bases,, and combinations thereof. 

The term "isolated" means that the material is removed from its original 
environment (eg. the natural environment if it is naturally occurring). For example, 

20 a naturally occurring nucleic acid or polypeptide present in a living plant is not 
isolated, but the same nucleic acid or polypeptide separated from some or all of 
the coexisting materials in the natural system, is isolated. Such an isolated nucleic 
add could be part of a vector and/or such a nudeic acid could be part of a 
composition, and still be isolated in that such a vector or composition is not part of 

25 its natural environment An isolated polypeptide could be part of a composition and 
still be isolated in that such a composition Is not part of its natural environment. 

The term "purified" means that the nudeic add or polypeptide is 
substantially free of other nudeic acids or polypeptides. 



6 



By "functionally active" in respect of a nucleotide sequence it is meant that 
the fragment or variant is capable of modifying organic acid biosynthesis in a plant. 
A variant in this context can be an analogue, derivative or mutant and includes 
naturally occurring allelic variants and non-naturally occurring variants. Additions, 
deletions, substitutions and derivatlzatlons of one or more of the nucleotides are 
contemplated so long as the modifications do not result in loss of functional activity 
of the fragment or variant. Preferably the functionally active fragment or variant 
has at least approximately 80% identity to the functional part of the above 
mentioned sequence, more preferably at least approximately 90% Identity, most 
preferably at least approximately 95% identity. Such ilinctionally active variants 
and fragments include, for example, those having nucleic acid changes which 
result in conservative amino acid substitutions of one or more residues in the 
corresponding amino acid sequence. Preferably the fragment has a size of at least 
30 nucleotides, more preferably at least 45 nucleotides, most preferably at least 60 
nucleotides. 

By "functionally active" in the context of a polypeptide it is meant that the 
fragment or variant has one or more of the biological properties of the proteins CS, 
CS-like. MDH, MDH-IIke, PEPC and PEPC-like. A variant In this context includes 
additions, deletions, substitutions and derivatizations of one or more of the amino 
acids are contemplated so Ibng as the nradifications do not result in loss of 
functional activity of the firagment or variant. Preferably the functionally active 
fragment or variant has at least approximately 60% identity to the functional part of 
the above mentioned sequence, more preferably at least approximately 80% 
identity, most preferably at least appnbximately 90% identity. Such functionally 
active variants and fragments Include, for example, those having conservative 
amino acid substitutions of one or more residues in the corresponding amino acid 
sequence. Preferably the fragment has a size of at least 10 amino acids, more 
preferably at least 15 amino acids, most preferably at least 20 amino acids. 

The term "construct" as used herein refers to an artificially assembled or 
isolated nucleic acid molecule which includes the gene or genes of interest. In 
general a constmct may Include the gene or genes of interest, a maricer gene 
which In some cases can also be Vne gene of interest and appropriate regulatory 



sequences. It should be appreciated that the Inclusion of regulatory sequences in 
a construct is optional, for example, such sequences may not be required in 
situations where the regulatory sequences of a host cell are to be used. The temi 
construct includes vectors but should not be seen as being limited thereto. 

5 The term "vector' as used herein encompasses both cloning and 

expression vectors. Vectors are often recombinant molecules containing nucleic 
acid molecules from several sources. 

By "operatlvely linked" in respect of one or more regulatory, elements, 
nucleic acids or nucleic acid fragments and temninators. Is meant that said 
10 regulatory element(s) is capable of causing expression of said nucleic acid(s) or 
nucleic acid fragment(s) in a plant cell and said termlnator(s) is capable of 
terminating expression of said nucleic acld(s) or nucleic acid fragment(s) in a plant 
cell. Preferably, said regulatory element(s) is upstream of said nucleic acld(s) or 
nucleic acid fragment{s) and said temiinator(s) is downstream of said nucleic 
. 1 5 acid(s) or nucleic acid fragment(s). 

In a particulariy preferred embodiment, each nudeic acid or nucleic acid 
fragment has one or more upstream promoters and one or more downstream 
terminators, although expression of more than one nucleic acid or nucleic acid 
fragment from an upstream regulatory element(s) or temrtination of more than one 
20 nucleic acid or nucleic acid fragment from a downstream tennlnator(s) is not 
precluded. 

By "an effective amount" of a nucleic acid or nucleic acid fragment it is 
meant an amount sufficient to result In an identifiable phenotypic trait in said plant, 
or a plant, plant seed or other plant part derived therefrom. Such amounts can be 

25 readily determined by an appropriately sl^llled person, taking into account the type 
of plant, the route of administration and other relevant factors. Such a person will 
readily be able to determine a suitable amount and method of administration. See. 
for example, Maniatis et al. Molecular Cloning: A Laboratory Manual. Cold Spring 
Harbor Laboratory. Cold Spring Harbor, the entire disclosure of which Is 

30 incorporated herein by reference. 
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It will also be uncieretood that the term "comprises" (or its grammatical 
variants) as used in this specification is equivalent to the temi "includes" and 
should not be taken as excluding the presence of other elements or features. 

Such nucleic acids or nucleic acid fragments could be assembled to fomn a 
5 consensus contig. As used herein, the tenn "consensus contig" refers to a 
nucleotide sequence that Is assembled from two or more constituent nucleotide 
sequences that share common or bveriapping regions of sequence homology. For 
example, the nucleotide sequence of two or more nucleic acids or nucleic acid 
fragments can be compared and aligned In order to identify common or 
10 overiapping sequences. Where common or overiapping sequences exist between 
two or more nucleic acids or nucleic acid fragments, the sequences (and thus their 
con-esponding nucleic acids or nucleic acid fragments) can be assembled into a 
single contiguous nucleotide sequence. 

In a preferred embodiment of this aspect of the invention, the substantially 
15 purified or isolated nucleic acid or nucleic acid fragment encoding an CS or CS- 
like protein includes a nucleotide sequence selected from the group consisting of 
(a).sequences shown In Figures 1. 3. 4. 6, 7, 9. 99, .101. 102 and 104 hereto; (b) 
complements of the sequences shown in Figures 1, 3, 4, 6, 7, 9. 99. 101. 102 and 
104 hereto; (c) sequences antisense to the sequences recited in (a) and (b); and 
20 (d) functionally active fragments and variants of the sequences recited in (a), (b) 
and (c). 

In a further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated nucleic acid or nucleic acid fragment encoding a 
MDH or MDH-like protein includes a nucleotide sequence selected from the group 

25 consisting of (a) sequences shown in Figures 11, 13, 14. 16, 17, 19, 21, 23, 25, 
26, 28. 30. 31, 33. 35. 37, 38. 40, 50. 55. 57, 58, 60, 61. 63, 64, 66, 67, 69, 70, 72. 
73. 75. 76, 78. 79. 81. 82 and 84 hereto; (b) complements of the sequences 
shown in Figures 11. 13. 14, 16. 17. 19. 21. 23. 25. 26. 28. 30, 31. 33. 35. 37. 38. 
40. 50. 55, 57. 58. 60. 61. 63. 64. 66. 67. 69. 70. 72. 73, 75, 76, 78. 79, 81. 82 and 

30 84 hereto; (c) sequences antisense to the sequences recited in (a) and (b); and (d) 
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functionally active fragments and variants of the sequences recited in (a), (b) and 
(c). 

In a ftjfther prefen-ed embodiment of this aspect of the invention, the 
substantially purified or Isolated nucleic acid or nucleic acid fragment encoding an 

5 PEPC or PEPC-like protein includes a nucleotide sequence selected from the 
group consisting of (a) sequences shown in Figures 42. 44. 46. 47. 49. 51, 53, 86, 
88. 89, 91, 92, 94, 95 and 97 hereto; (b) complements of the sequences shown in 
Figures 42, 44, 46, 47, 49. 51. 53, 86, 88. 89, 91. 92. 94. 95 and 97 hereto; (c) 
sequences antisense to the sequences recited in (a) and (b); and (d) functionally 

1 0 active fragments and variants of the sequences recited in (a), (b) and (c). 

Genes encoding other CS or CS-like. MDH or MDH-like and PEPC or 
PEPC-like proteins, either as cDNAs or genomic DNAs. may be isolated directly 
by using all or a portion of the nucleic acids or nucleic acid fragments of the 
present invention as hybridisation probes to screen libraries from the desired plant 
15 employing the methodology well known to those skilled In the art. Specific 
oligonucleotide probes based upon the nucleic acid sequences of the present 
Invention may be designed and synthesized by methods known in the art. 
Moreover, the entire sequences may be used directly to synthesize DNA probes 
by methods known to the skilled artisan such as random primer DNA labelling. 

20 nick translatfon, or end-labelling techniques, or RNA probes using available in vitro 
transcription systems. In additfon. specific primers may be designed and used to 
amplify a part or all of the sequences of the present invention. The resulting 
amplification products may be labelled directly during iamplificatlon reactions or 
labelled after amplification reactions, and used as probes to isolate full-lengtti 

25 cDNA or genomic fragments under conditions of appropriate stringency. 

In addition, short segments of ttie nucleic acids or nucleic acW fragments of 
the present invention may be used in protocols to amplify longer nuclefc acids or 
nucleic acid fragments encoding homologous genes from DNA or RNA. For 
example, polymerase chain reaction may be perfomied on a library of cloned 
30 nucleic acid fragments wherein the sequence of one primer is derived from the 
nucleic acW sequences of the present invention, and the sequence of the ottier 
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primer takes advantage of the presence of the polyadenylic acid tracts to the 3' 
end of the mRNA precursor encoding plant genes. Alternatively, the second primer 
sequence may be based upon sequences derived from the cloning vector. For 
example, those skilled In the art can follow the RACE protocol (Frohman et at. 
5 (1988) Proc. A/atf. Acad Set. USA 85:8998. the entire disclosure of which is 
Incorporated herein by reference) to generate cDNAs by using PCR to amplify 
copies of the region between a single point In the transcript and the 3' or 5' end. 

Using commercially available 3' RACE and 5' RACE systems (BRL). specific 3' or 

5* cDNA fragments may be isolated (Ohara et al. (1989; Proc. Natl. Acad Sci USA 
10 86:5673; Loh et al. (1989) Science 243:217. the entire disclosures of which are 

incorporated herein by reference). Products generated by the 3* and 5* RACE 

procedures may be combined to generate full-length cDNAs. 

In a further aspect of the present invention there is provided a substantially 
purified or isolated polypeptide from a clover {Trifolium), medic (Medicago), 
15 ryegrass (Lolium) or fescue {Festuca) species, selected from the group consisting 
of CS or CS-like. MDH or MDH-like and PEPC or PEPC-like proteins; and 
functionally active fragments and variants thereof. 

The clover [Trifolium), medic {Medicago), ryegrass (Lolium) or fescue 
(Festuca) species may be of any suitable type, including white clover {Trifolium 

20 repensl red clover {Trifolium pretense), subten^nean clover {Trifolium 
subterraneum), alfalfa {Medicago sativa), Italian or annual ryegrass {Lolium 
multiflorum). perennial ryegrass {Lolium perenne), tall fescue {Festuca 
arundinacea), meadow fescue {Festuca pratensis) and red fescue {Festuca rubra). 
Preferably the species is a clover or a ryegrass, more preferably white clover {T. 

25 repens) or perennial ryegrass (L. perenrte). 

In a preferred embodiment of this aspect of the invention, the substantially 
purified or Isolated CS or CS-like polypeptide includes an amino acid sequence 
selected from the group consisting of the sequences shown In Figures 2. 5. 8, 10, 
100 and 103 hereto, and functionally active fragments and variants thereof. 
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In a further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated MDH or MDH-like polypeptide includes an amino 
acid sequence selected from the group consisting of the sequences shown in 
Figures 12. 15, 18. 20. 22. 24. 27. 29. 32. 34. 36. 39. 41. 56. 59. 62, 65, 68. 71, 
5 74, 77. 80, 83 and 85 hereto, and functionally active fragments and variants 
thereof. 

In a further preferred embodiment of this aspect of the invention, the 
substantially purified or Isolated PEPC or PEPdike polypeptide includes an 
amino acid sequence selected from the group consisting of the sequences shown 
10 in Figures 43. 45. 48, 50, 52. 54. 87. 90. 93. 96 and 98 hereto, and functionally 
active fragments and variants thereof. 

In a further embodiment of this aspect of the invention, there Is provided a 
polypeptide recombinantly produced from a nucleic acid or nucleic acid fragment, 
acconjing to the present invention. Techniques for recombinantly producing 
1 5 polypeptides are known to those skilled in the art. 

Availability of the nucleotide sequences of the present invention and 
deduced amino acid sequences facilitates immunological screening of cDNA 
expression libraries. Synthetic peptides representing portions of the instant amino 
acid sequences, may be synthesized. These peptides may be used to immunise 
20 animals to produce polyclonal or monoclonal antibodies with specificity for 
peptides and/or proteins including the amino acid sequences. These antibodies 
may be then used to screen cDNA expression libraries to isolate full-length cDNA 
clones of Interest. 

A genotype Is the genetic constitution of an Individual or group. Variations in 
25 genotype are important In commercial breeding programs, in determining 
parentage. In diagnostics and fingerprinting, and the like. Genotypes can be 
readily described In terms of genetic mariners. A genetic maricer identifies a 
specific region or locus in the genome. The more genetic mariners, the finer 
defined is the genotype. A genetic marker becomes particulariy useful when it Is 
30 allelic between organisms because it then may serve to unambiguously identify an 



12 

individual. Furthermore, a genetic marker becomes particularly useful when it is 
based on nucleic acid sequence information that can unambiguously establish a 
genotype of an Individual and when the function encoded by such nucleic acid is 
known and is associated with a specific trait. Such nucleic acids and/or nucleotide 
5 sequence information including single nucleotide polymorphisms (SNPs). 
variations in single nucleotides between allelic forms of such nucleotide sequence, 
may be used as perfect markers or candidate genes for the given trait. 

Applicants have Identified a number of SNPs of the nucleic acids or nucleic 
acid fragments of the present invention. These are indicated (marked with grey on 
10 the t?lack background) in the figures that show multiple alignments of nucleotWe 
sequences of nucleic acid fragments contributing to consensus contig sequences. 
See for example. Figures 3. 6. 13. 16. 25. 30. 37. 46. 57. 60. 63. 66. 69. 72. 75. 
78. 81 . 88. 91, 94 101 and 104 hereto. 

Accordingly, in a further aspect of the present invention, there is provided a. 

15 substantially purified or isolated nucleic acid or nucleic acid fragment including a 
single nucleotide polymorphism (SNP) from a nucleic acid or nucleic acid fragment 
according to the present invention, for example a SNP from a nucleic acid 
sequence shown in Figures 3. 6. 13. 16. 25. 30. 37. 46. 57. 60. 63. 66. 69, 72. 75. 
78. 81. 88. 91. 94. 101 and 104 hereto; or complements or sequences antisense 

20 thereto, and functionally active fragments and variants thereof. The invention 
further provides a substantially purified or Isolated nucleic acid or nucleic acid 
fragment Including a single nucleotide polymorphism (SNP) isolated by the method 
of this Invention. 

In a still further aspect of the present Invention there is provided a method 
25 of Isolating a nucleic acid or nucleic acid fragment of the present invention 
including a SNP. said method including sequencing nucleic acid fragments from a 
nucleic acid library. The method includes the step of identifying the SNP. 



The nucleic acid library may be of any suitable type and Is preferably a 
cDNA library. 
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The nucleic acid or nucleic acid fragment may be isolated from a 
recombinant plasmid or may be amplified, for example using polymerase chain 
reaction. 

The sequencing may be perfonned by techniques known to those skilled in 
5 the art. 

In a still further aspect of the present Invention, there is provided use of the 
nucleic acids or nucleic acid fragments of the present invention including SNPs, 
and/or nucleotide sequence infomnation thereof, as molecular genetic markers. 

In a still further aspect of the present invention there is provided use of a 
10 nucleic acid or nucleic acid fragment of the present invention, and/or nucleotide 
sequence infonnation thereof, as a molecular genetic marker. 

More particulariy, nucleic acids or nucleic acid fragments according to the 
present invention and/or nucleotide sequence infonnation thereof may be used as 
a molecular genetic marker for quantitative trait loci (QTL) tagging. QTL mapping, 

15 DNA fingerprinting and in marker assisted selection, particularly in clovers, alfalfa, 
ryegrasses and fescues. Even more particulariy. nucleic acids or nucleic acid 
fragments according to the present invention and/or nucleotide sequence 
infonnation thereof may be used as molecular genetic markers in plant 
improvement In relation to plant tolerance to abiotic stresses such aluminium toxic 

20 acid soils; In relation to nutrient acquisition efficiency including phosphoms; in 
relation to nitrogen fixation; in relation to nodulation. Even more particulariy, 
sequence Infonnation revealing SNPs In allelic variants of ttie nucleic acids or 
nucleic acid fragments of the present Invention and/or nucleotide sequence 
information ttiereof may be used as molecular genetic markers for QTL tagging 

25 and mapping and in marker assisted selection, particulariy in clovers, alfalfa, 
ryegrasses and fescues. 

In a still further aspect of the present invention there is provided a 
construct Including one or more nucleic acids or nucleic acid fragments according 
to the present invention. 
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In a still further aspect of the present invention there is provided a vector 
including one or more nucleic acids or nucleic acid fragments according to the 
present invention. 

In a preferred embodiment of this aspect of the invention, the vector may 
5 Include one or several of the following: one or more regulatory elements such as 
promoters, one or more nucleic acids or nucleic acid fragments according to the 
present invention and one or more terminators; said one or more regulatory 
elements, one or more nucleic acids or nucleic acid fragments and one or more 
terminators being operatively linked. 

10 In a preferred embodiment of the present inyention the vector may contain 

nucleic acids or nucleic aqid fragments encoding both CS or CS-«ke and MDH or 
MDH-like polypeptides, operatively linked to a regulatory element or regulatory 
elements, such that both CS or CS-like and MDH or MDH-like proteins are 
expressed. 

15 In another preferred embodiment of the present invention the vector may 

contain nucleic acids or nucleic acid fragments encoding both CS or CS-like and 
PEPC or PEPC-like polypeptides, operatively linked to a regulatory element or 
regulatory elements, such that both CS or CS-like and PEPC or PEPC-like 
proteins are expressed. 

20 In yet another particulariy preferred embodiment of the present invention 

the vector may contain nucleic acids or nucleic acid fragments encoding both MDH 
or MDH-like and PEPC or PEPC-like polypeptides, operatively linked to a 
regulatory element or regulatory elements, such that both MDH or MDH-like and 
PEPC or PEPC-like proteins are expressed. 

25 In another particularly preferred embodiment of the present Invention the 

vector may contain nucleic acids or nucleic acid fragments encoding all three of 
CS or CS-like. MDH or MDH-like and PEPC or PEPC-like. operatively linked to a 
regulatory element or regulatory elements, such that all three of CS or CS-like, 
MDH or MDH-like and PEPC or PEPC-like proteins are expressed. 
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The vector may be of any suitable type and may be viral or non-viral. The 
vector may be an expression vector. Such vectors include chromosomal, non- 
chromosomal and synthetic nucleic acid sequences, eg. derivatives of plant 
viruses; bacterial plasmlds; derivatives of the Ti plasmid from Agrobacterium 
5 tumefaciens, derivatives of the Ri plasmid from Agrobacterium rhizogenes; phage 
DNA; yeast artificial chromosomes; bacterial artificial chromosomes; binary 
bacterial artificial chromosomes; vectors derived from combinations of plasmlds 
and phage DNA. However, any other vector may be used as long as it Is 
replicable. integrative or viable In the plant cell. 

1 0 The regulatory element and tenninator may be of any suitable type and may 

be endogenous to the target plant cell or may be exogenous, provided that they 
are functional in the target plant cell. 

Preferably the regulatory element is a promoter. A variety of promoters 
which may be employed in the vectors of the present Invention are well known to 

15 those skilled in the art. Factors influencing the choice of promoter include the 
desired tissue specificity of the vector, and whether constitutive or Inducible 
expression is desired and the nature of the plant cell to be transformed (eg. 
monocotyledon or dicotyledon). Particulariy suitable constitutive promoters include 
the Cauliflower Mosaic Virus 35S (CaMV 35S) promoter, the maize Ubiquitin 

20 promoter, and the rice Actin promoter. Particulariy suitable tissue specific 
promoters include root prevalent promoters. 

A variety of terminators which may be employed in the vectors of the 
present invention are also well known to those skilled in the art. The temninator 
may be from the same gene as the promoter sequence or a different gene. 
25 Particulariy suHable terminators are polyadenylation signals, such as the CaMV 
35S polyA and other temiinators from the nopaline synthase (nos) and the 
octopine synthase (ocs) genes. 

The vector, in addition to the regulatory element(s). the nucleic acid(s) or 
nucleic acid fragment(s) of the present invention and the tenninator(s). may 
30 Include further elements necessary for expression of the nucleic acid(s) or nucleic 
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acid fragment(s). in different combinations, for example vector backbone, origin of 
replication (on), multiple cloning sites, spacer sequences, enhancers, introns (sucli 
as the maize Ubiquitin Ubi intron), antibiotic resistance genes and other selectable 
marker genes [such as the neomycin phosphotransferase (npf2) gene, the 
5 hygramycin phosphotransferase (hph) gene. the phosphinothricin 
acetyltransferase (bar or paf) gene, the phospho-mannose isomerase (pmi) gene], 
and reporter genes (such as beta-glucuronidase (GUS) gene {gusA)]. The vector 
may also contain a ribosome binding site for translation Initiation. The vector may 
also Include appropriate sequences for amplilying expression. 

10 As an aftemative to use of a selectable marker gene to provide a 

phenotypic trait for selection of transformed host cells, the presence of the vector 
in transformed cells may be detemnined by other techniques well known in the art. 
such as PCR (polymerase chain reaction). Southern blot hybridisation analysis, 
histochemical GUS assays, northern and Western blot hybridisation analyses. 

15 Those skilled in the art will appreciate that the various components of the 

vector are operatively linked, so as to result in expression of said nucleic acld(s) or 
nucleic acid fragment(s). Techniques for operatively linking the components of the 
vector of the present invention are well known to those skilled in the art. Such 
techniques include the use of linkers, such as synthetic linkers, for example 

20 including one or more restriction enzyme sites. 

The constructs and vectors of the present invention may be incorporated 
into a variety of plants, including monocotyledons (such as grasses from the 
genera Lolium, Festuca, Paspalum. Pennisetum. Panicum and other forage and 
turfgrasses. com. oat. sugarcane, wheat and bariey). dicotyledons (such as 

25 arabidopsis, tobacco, clovers, medics, eucalyptus, potato, sugarbeet. canola, 
soybean, chickpea) and gymnospemns. In a preferred embodiment, the constructs 
and vectors may be used to transform monocotyledons, preferably grass species 
such as ryegrasses {Lolium species) and fescues (Festuca species), more 
preferably perennial ryegrass, including forage- and turf-type cultlvars. In an 

30 alternate preferred embodiment, the vectors may be used to transform 
dicotyledons, preferably forage legume species such as clovers {Trifolium species) 
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and medics {Medicago species), more preferably white clover (Trifolium repens), 
red clover {Trifolium pretense), subterranean clover (Trifolium subterraneum) and 
alfalfa {Medicago sativa). Clovers, alfalfa and medics are key pasture legumes in 
temperate climates throughout the world. 

5 Techniques for incorporating the constructs and vectors of the present 

invention into plant cells (for example by transduction, transfection or 
transformation) are known to those skilled In the art. Such techniques include 
Agmbacterium mediated introduction, electroporatlon to tissues, cells and 
protoplasts, protoplast fusion, injection into reproductive organs, injection into 

10 immature embryos and high velocity projectile Introduction to cells, tissues, calll, 
immature and mature embryos. The choice of technique will depend largely on the 
type of plant to be transformed. 

Cells incorporating the constructs and vectors of the present invention may 
be selected, as described above, and then cultured in an appropriate medium to 
15 • regenerate transformed plants, using techniques well known in the art. The culture 
conditions, such as temperature, pH and the like, will be apparent to the person 
skilled in the art. The resulting plants may be reproduced, either sexually or 
asexually, using methods well known in the art:, to pioduce successive generations 
of transformed plants. 

20 In a further aspect of the present Invention there is provided a plant cell, 

plant, plant seed or other plant part, including, e.g. transfonned with, one or more 
constructs, vectors, nucleic acids or - nucleic acid fragments of the present 
invention. 

The plant cell, plant, plant seed or other plant part may be from any suitable 
25 species, including monocotyledons, dicotyledons and gymnosperms. In a 
prefenred embodiment the plant cell, plant, plant seed or other plant part may be 
from a monocotyledon, preferably a grass species, more preferably a ryegrass 
{Lolium species) or fescue {Festuca species), more preferably perennial ryegrass, 
including both forage- and turf-type cultivars. In an alternate prefenred embodiment 
30 the plant cell, plant, plant seed or other plant part may be from a dicotyledon. 
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' preferably forage legume species such as clovers {TrifoHum species) and medics 
{Medicago species), more preferably white clover {TrifoHum repens), red clover 
(TrifoHum pratense), subterranean clover {TrifoHum subterraneum) and alfalfa 
{Medicago sativa). 

5 The present invention also provides a plant, plant seed or other plant part, 

or a plant extract derived from a plant cell of the present invention. 

The present invention also provides a plant, plant seed or other plant part, 
or a plant extract derived from a plant of the present invention. 

In a further aspect of the present invention there is provided a method of 
10 modifying organic acid biosynthesis; of nrradifying organic add secretion; of 
modifying phosphorous and other nutrients acquisition efficiency in plants; of 
modifying aluminium and acid soil tolerance in plants; of modifying nitrogen 
fixation and nodule function, said method including introducing into said plant an 
effective amount of a nucleic acid or nucleic acid fragment, construct and/or vector 
15 according to the present invention. . 

Using the methods and products of the present invention, organic acid 
biosynthesis; organic acid secretion; phosphorous and other plant nutrient 
acquisition efficiency; aluminium and acid soil tolerance; nitrogen fixation and 
nodule function, may be increased or othenwise altered, for example by 
20 incorporating additional copies of one or more sense nucleic acids or nucleic acid 
fragments of the present invention. They may be decreased or othenvise altered, 
for example by incorporating one or more antisense nucleic acids or nucleic acid 
fragments of the present invention. 

The present invention will now be. more fully described with reference to the 
26 accompanying Examples and drawings. It should be understood, however, that the 
description following is illustrative only and should not be taken in any way as a 
restriction on the generality of the invention described above. 



In the Figures 
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Figure 1 show§ the consensus contig nucleotide sequence of LpCSa. 
Figure 2 shows the deduced amino acid sequence of LpCSa. 
Figure 3 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpCSa. 
5 Figure 4 shows the consensus contig nucleotide sequence of LpCSb. 

Figure 5 shows the deduced amino acid sequence of LpCSb. 
Figure 6 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpCSb. 

Figure 7 shows the nucleotide sequence of LpCSc. 
10 Figure 8 shows the deduced amino acid sequence of LpCSc. 

Figure 9 shows the nucleotide sequence of LpCSd. 
Figure 10 shows the deduced amino acid sequence of LpCSd. 
Figure 11 shows the consensus contig nucleotide sequence of LpMDHa. 
Figure 12 shows the deduced amino acid sequence of LpMDHa. 
15 Figure 13 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence LpMDHa. 

Figure 14 shows the consensus contig nucleotide sequence of LpMDHb. 
Figure 1 5 shows the deduced amino acid sequence of LpMDHb. 
Figure 16 shows the nucleotide sequences of the nucleic acid fragments 
20 contributing to the consensus contig sequence LpMDHb. 

Figure 1 7 shows the nucleotide sequence of LpMDHc. 
Figure 18 shows the deduced amino acid sequence of LpMDHc. 
Figure 19 shows the nucleotide sequence of LpMDHd. 
Figure 20 shows the deduced amino acid sequence of LpMDHd. 
25 Figure 21 shows the nucleotide sequence of LpMDHe. 

Figure 22 shows the deduced amino acid sequence of LpMDHe. 
Figure 23 shows the consensus contig nucleotide sequence of LpMDHf. 
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Figure 24 shows the deduced amino acid sequence of LpMDHf. 
Figure 25 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpMDHf. 

Figure 26 shows the nucleotide sequence of LpMDHg. 
Figure 27 shows the deduced amino acid sequence of LplS/IDHg. 
Figure 28 shows the consensus contig nucleotide sequence of LpMDHh. 
Figure 29 shows the deduced amino acid sequence of LpMDHh. 
Figure 30 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpMDHh. 

Figure 31 shows the nucleotide sequence of LpMDHi. 
Figure 32 shows the deduced amino acid sequence of LpMDHi. 
Figure 33 shows the nucleotide sequence of LpMDHj. 
Figure 34 shows the deduced amino acid sequence of LpMDHj. 
Figure 35 shows the consensus contig nucleotide sequence of LpMDHk. 
Figure 36 shows the deduced amino acid sequence of LpMDHk. 
Figure 37 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpMDHk. 

Figure 38 shows the nucleotide sequence of LpMDHi. 
Figure 39 shows the deduced amino acid sequence of LpMDHi. 
Rgure 40 shows the nucleotide sequence of LpMDHm. 
Figure 41 shows the deduced amino acid sequence of LpMDHm. 
Figure 42 shows the nucleotide sequence of LpPEPCa. 
Figure 43 shows the deduced amino acid sequence of LpPEPCa. 
Figure 44 shows the consensus contig nucleotide sequence of LpPEPCb. 
> Figure 45 shows the deduced amino acid sequence of LpPEPCb. 

Figure 46 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence LpPEPCb. 
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Figure 47 shows the nucleotide sequence of LpPEPCc. 
Figure 48 shows the deduced amino acid sequence of LpPEPCc. 
Figure 49 shows the nucleotide sequence of LpPEPCd. 
Figure 50 shows the deduced amino acid sequence of LpPEPCd. 
5 Figure 51 shows the nucleotide sequence of LpPEPCe. 

Figure 52 shows the deduced amino acid sequence of LpPEPCe. 
Figure 53 shows the nucleotide sequence of LpPEPCf. 
Figure 54 shows the deduced amino acid sequence of LpPEPCf. 
Figure 55 shows the consensus contig nucleotide sequence of TrMDHa. 
1 0 Figure 56 shows the deduced amino acid sequence of TrMDHa. 

Figure 57 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHa. 

Figure 58 shows the consenisus contig nucleotide sequence of TrMDHb. 
Figure 59 shows the deduced amino acid sequence of TrMDHb. 
15 Figure 60 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence TrMDHb. 

Figure 61 shows the consensus contig nucleotide sequence of TrMDHc. 
Figure 62 shows the deduced amino acid sequence of TrMDHc. 
Figure 63 shows the nucleotide sequences of the nucleic acid fragments 
20 contributing to the consensus contig sequence TrMDHc. 

Figure 64 shows the consensus contig nucleotide sequence of TrMDHd, 
Figure 65 shows the deduced amino acid sequence of TrMDHd. 
Figure 66 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHd. 
25 Figure 67 shows the consensus contig nucleotide sequence of TrMDHe. 

Figure 68 shows ttie deduced amino acid sequence of TrMDHe. 
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Figure 69 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHe. 

Figure 70 shows the consensus contig nucleotide sequence of TrMDHf. 
Figure 71 shows the deduced amino acid sequence of TrMDHf. 
5 Figure 72 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence TrMDHf. 

Figure 73 shows the consensus contig nucleotide sequence of TriVIDHg. 
Figure 74 shows the deduced amino acid sequence of TrIS/IDHg. 
Figure 75 shows the nucleotide sequences of the nucleic acid fragments 
1 0 contributing to the consensus contig sequence TrMDHg. 

Figure 76 shows the consensus contig nucleotide sequence of TrIVIDHh. 
Figure 77 shows the deduced amino acid sequence of TriVIDHh. 
Figure 78 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TriS/IDHh. 
1 5 Figure 79 shows the consensus contig nucleotide sequence of Triy/lDHi. 

Figure 80 shows the deduced amino acid sequence of TriVIDHi. 
Figure 81 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrMDHi. 

Figure 82 shows the nucleotide sequence of Tri\4DHj. 
20 Figure 83 shows the deduced amino acid sequence of Triy^DHj. 

Figure 84 shows the nucleotide sequence of Tri^DHk. 
Figure 85 shows the deduced amino acid sequence of TriS/IDHk. 
Figure 86 shows the consensus contig nucleotide sequence of Tri^EPCa. 
Figure 87 shows the deduced amino acid sequence of TrPEPCa. 
25 Figure 88 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence TrPEPCa. 

Figure 89 shows the consensus contig nucleotide sequence of TrPEPCb. 
Figure 90 shows the deduced amino acid sequence of TrPEPCb. 
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Figure 91 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrPEPCb. 

Figure 92 shows the consensus contig nucleotide sequence of TrPEPCc. 
Figure 93 shows the deduced amino acid sequence of TrPEPCc. 
5 Figure 94 shows the nucleotide sequences of the nucleic acid fragments 

contributing to the consensus contig sequence TrPEPCc. 

Figure 95 shows the nucleotide sequence of TrPEPCd. 
Figure 96 shows the deduced amino acid sequence of TrPEPCd. 
Figure 97 shows the nucleotide sequence of TriPEPCe. 
1 0 Figure 98 shows the deduced amino acid sequence of TrPEPCe. 

Figure 99 shows the consensus contig nucleotide sequence of TrCSa, 
Figure 100 shows the deduced amino add sequence of TrCSa. 
Figure 101 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrCSa. 
1 5 Figure 1 02 shows the consensus contig nucleotide sequence of TrCSb. 

Figure 103 shows the deduced amino acid sequence of TrCSb. 
Figure 104 shows the nucleotide sequences of the nucleic acid fragments 
contributing to the consensus contig sequence TrCSb. 



20 EXAMPLE 1 

Preparation of cDNA libraries, isolation and sequencing of cDNAs coding for 
OS, CS-Iil^e, MDH, IWIDH-like, PEPC and PEPC-illce proteins from white clover 
(TiUbllum mpens) and perennial ryegrass {Lolium persnne) 

25 cDNA libraries representing mRNAs from various organs and tissues of 

white clover {Trifolium repens) and perennial ryegrass {Lolium perenne) were 
prepared. The characteristics of the white clover and perennial ryegrass libraries, 
respectively, are described below (Tables 1 and 2). 
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TABLE 1 

cDNA libraries from white clover (Trifollum repens) 



Library 


Organ/Tissue 


01 wc 


Whole seedling, light grown : 


02wc 


Nodulated root 3, 5, 10. 14. 21 &28 day old seedling 


ViOWw 


Nodules pinched off roots of 42 day old rhizobium inoculated wc 


04wc 


Nodulated wc cut leaf and stem collected after 0. 1. 4, 6 &14 h after 

puttina 




Non-nodulated Inflorescences: <50% open, not fully open and fully 
open 


06wc 


Dark grown etiolated 


07wc 


Inflorescence - very early stages, stem elongation, < 15 petals. 15-20 
petals 


08wc 


seed frozen at -80°C, imbibed In dark overnight at 1 0°C 


09wc 


Drought stressed plants 


10wc 


AMV Infected leaf 


11wc 


WCMV infected leaf 


12wc 


Phosphorus starved plants 


13wc 


Vegetative stolon tip 


14wc 


stolon root initials 


15wc 


Senescing stolon 


16wc 


Senescing leaf 
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TABLE 2 

cDNA libraries from perennial ryegrass (Lolium perenne) 



1 Library 1 


M /Tied 1 A 1 

urgBfif 1 issue i 


01rg 


Roots from 3-4 day old light-grown seedlings 


02rg 


Leaves from 3-4 day old light-grown seedlings 


03rg 


Etiolated 3-4 day old dark-grown seedlings | 


04rg 


Whole etiolated seedlings (1 -5 day old and 1 7 days old) 1 


05rg 


Senescing leaves from mature plants 


06rg 


Whole etiolated seedlings (1 -5 day old and 1 7 days old) 


07rg 


Roots from mature plants grown in hydroponic culture 1 


08rg 


Senescent leaf tissue 


09rg 


Whole tillers and sliced leaves (0. 1. 3. 6. 12 and 24 h after 
harvesting) 


10rg 


Embryogenic suspension-cultured cells 1 


11rg 


Non-embryogenic suspension-cultured cells 1 


12rg 

1 


Whole tillers and sliced leaves (0. 1. 3. 6, 12 and 24 h after 
harvesting) 


13rg 


Shoot apices including vegetative apical meristems j 


14rg 


Immatijre inflorescences including different stages of inflorescence 
meristem and inflorescence development 


15rg 


Defatted pollen 1 


16rg 


Leaf blades and leaf sheaths (rfecL, rt)cS, cab, vwf2A subtracted) 


17rg 


Senescing leaves and tillers 


18rg 


Drought-stressed tillers (pseudostems from plants subjected to PEG- 
simulated drought stress) 
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Library 


Organ/Tissue 


19rg 


Non-embryogenic suspenslon-cuttured cells subjected to osmotic 
stress (grown in media with half-strength salts) 
(1 , 2, 3. 4, 5. 6, 24 and 48 h after transfer) 


20rg 


Non-embryogenIc suspension-cultured cells subjected to osmotic 
stress (grown in media with double-strength salts) 
(1 . 2. 3, 4. 5. 6, 24 and 48 h after transfer) 


21rg 


Drought-stressed tillers (pseudostems from plants subjected to PEG- 
simulated drought stress) 


22rg 


Spikelets with open and maturing florets 


23rg 


Mature roots (specific subtraction with leaf tissue) 



The cDNA libraries may be prepared by any of many methods available. 
For example, total RNA may be isolated using the Trizol method (Gibco-BRL. 
USA) or the RNeasy Plant Mini kit (Qiagen. Germany), following the 
5 manufacturers' Instructions. cDNAs may be generated using the SMART PCR 
cDNA synthesis kit (Clontech. USA), cDNAs may be amplified by long distance 
polymerase chain reaction using the Advantage 2 PCR Enzyme system (Clontech. 
USA), cDNAs may be cleaned using the GeneClean spin column (Bio 101. USA), 
tailed and size fractionated, according to the protocol provided by Clontech. The 
10 cDNAs may be introduced into the pGEM-T Easy Vector system 1 (Promega. 
USA) according to the protocol provided by Promega. The cDNAs in the pGEM-T 
Easy plasmid vector are transfected Into Escherichia coli Epicurian coll XLIO-Gold 
ultra competent cells (Stratagene. USA) according to the protocol provWed by 
Stratagene. 

15 Alternatively, the cDNAs may be introduced into plasmid vectors for first 

preparing the cDNA libraries in Uni-ZAP XR vectors according to the 
manufacturer's protocol (Stratagene Cloning Systems. La Jolla. CA. USA). The 
Uni-ZAP XR libraries are converted into plasmid libraries according to the protocol 
provided by Stratagene. Upon conversion, cDNA inserts will be contained in the 
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plasmid vector pBluescript. lii addition, the cDNAs may be Introduced directly into 
precut pBluescript II SK(+) vectors (Stratagene) using T4 DNA ligase (New 
England Biolabs), followed by transfection into E. coli DHIOB cells according to 
the manufacturer's protocol (GIBCO BRL Products). 

5 Once the cDNA inserts are in plasmid vectors, plasmid DMAs are prepared 

from, randomly picked bacterial colonies containing recombinant plasmids, or the 
insert cDNA sequences are amplified via polymerase chain reaction using primers 
specific for vector sequences flanking the inserted cDNA sequences. Plasmid 
DNA preparation may be performed robotically using the Qiagen QiaPrep Turbo kit 

10 (Qiagen. Germany) according to the protocol provided by Qiagen. Amplified insert 
DMAs are sequenced in dye-terminator sequencing reactions to generate partial 
cDNA sequences (expressed sequence tags or "ESTs"). The resulting ESTs are 
analyzed using an Applied Btosystems ABI 3700 sequence analyser. 

EXAMPLE 2 

1 5 DNA sequence analyses 

The cDNA clones encoding CS. CS-like. MDH. MDH-llke. PEPC and PEPC- 
like proteins were identified by conducting BLAST (Basic Local Alignment Search 
Tool; Altschul et al. (1993) J. Mol. Biol. 215:403-410) searches. The cDNA 
sequences obtained were analysed for similarity to all publicly available DNA 

20 sequences contained In the eBloinfomnatics nucleotide database using the 
BLASTn algorithm provided by the National Center for Bfotechnology Infonmatlon 
(NCBI). The DNA sequences were translated in all reading frames and compared 
for similarity to all publicly available protein sequences contained in the SWISS- 
PROT protein sequence database using BLASTx algorithm (v 2.0.1) (Gish and 

25 States (1 993) Nature Genetics 3:266-272) provided by the NCBI. 

The cDNA sequences obtained and identified were then used to identify 
additional identical and/or overiapping cDNA sequences generated using the 
BLASTN algorithm. The identical and/or overiapping sequences were subjected to 
a multiple alignment using the CLUSTALw algorithm, and to generate a consensus 
30 contig sequence derived from tills multiple sequence alignment. The consensus 
contig sequence was then used as a query for a search against the SWISS-PROT 
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protein sequence database using the BLASTx algorithm to confirm the initial 
identification. 

Finally. It is to be understood that various alterations, modifications and/or 
additions may be made without departing from the spirit of the present invention as 
outlined herein. 

Agriculture Victoria Services Pty Ltd 
AgResearch Limited 

By their Registered Patent Attorneys 

Freehills Carter Smith Beadle 10 March 2004 



re 1 Consensus contig nucleotide sequence of LpCSa 

* ' 20 * ^0 * 60 
LpCSa : Ginn?TATATTGACGGGGATGAGGGAATTCT^CGCTA^^ : 

* • 80 * 100 * 120 
LpCSa : GGCTGAAAGCAGCTCGTTTGTTGAGGTC<3CCTACCTCTTJU^TGTATGGGAA^ : 



60 



120 



* 140 * 1^0 * 180 

LpCSa : CCAGAGTCAACTGGCAGGCTGGGAGTTTGCAATTTCGCAGCACTCTGCTGTTCCTX^ : 180 



* 200 * 220 * 240 

LpCSa : ACTCTTGGATATAATACAATCAATGCCTCATGATGCCCACCCCATGGGTGTCCrro t 



* 380 * 400 * 420 

LpCSa : AGTAATAGCAGCTGCAGCCTATCTGAQATTAGCAGGAAGGCCCTTTGTCCTTCOT 



* 620 * 640 * 660 

LpCSa : CACTGCrCTTTCTGGTGCTGTTGGAGCTCTATATGGTCCACTGCATGGTGGCG 



♦ . 740 * 760 * 780 

LpCSa : GGaAGTGAAGAACAGGAAGCGQAAAATGTCTGGTTTTaQGCACCpTGTGTATAAGA^^ 



240 



* 260 * ■ 280 * 300 
LpCSa : TGCAATGAGCACACTTTCAGTCTTCCATCCAGATGCAAACCCTGCTCTTAGAGGTCAA : 300 

* 320 * 340 * 360 
LpCSa : TCTATACAAGTCGiAGCAGGTTAGGGATAAGCAAATTGTACGAGTTCT^^ s 360 



420 



* 440 * 460 * 480 

LpCSa : TAATCTCTCTTATTCAGAAAATOTCTTGTATATGdTGGACTCTATGGGTG^ : 480 



* 500 * 520 * 540 

LpCSa : TAAGGCAAATCCCAGACTTGCCCGGGTTCTGGATGTCCTTTTTATTCTO^ : 



540 



* 560 * 580 * 600 

LpCSa : CGAAATGAACTGCTCAACAGCTGCTGTTAGGCACCTTGCTTCAAQTGGTGTCGATC^ 600 



660 



* 680 * 700 * 720 

LpCSa : GGCGGTACTTAAAATGTTAAATGAGATTGGAAQTGTAGAGAATATTCCGGAAOT : 720 



780 



* • 800 * 820 * • 840 

LpCSa ; TGATCCTCGTGCTAAAGTCATCCGQAAGTTAGCGGAGGAGGTTTTCACGATTGTGC^ : 840 

* 860 * • 880 * 900 
LpCSa : GGATCCTCTTATCGAGGTAGCTQTTGCTTTGGAGAAGGCAGCACTGTCAGACGAGTATOT : 900 

* 920 * 940 * 960 
LpCSa : TATCAAGAGGAAGCTGTATCCAAATGTGGATTTTTATTCTGGCCTAATATATAGGG : 960 



»* 980 * 1000' * 1020 

Sa : GGGATTCCCTACAGAGTim^TCCCTGTrCTGTTTGC^^ : X020 



* 1040 * 1060 * 1080 

LpCSa : AGCACATTGGAAGGAGTCACTTGATGACCCCGACAATAAAATTATGAGGCCCCAACAGG : 1080 



* 1100 * 1120 *■ 1140 

LpCSa : ATAO^CCGGTACTTGGCTAAGGaVTTACACCCaVGTGAGAGAACGGGTQCC^^ s 1140 



★ 1160 * 1180 * 1200 

IipCSa : CAGTGAGCAGCTTGGGCAGATCGCTACATCAAACGCGACGAGGCQTCGGCGTGCTGGCTC : 1200 



* 1220 * 1240 * 1260 

IipCSa .: TGCCCTQTAGAACAGTCTGCATGATACAGCATACAGTCCACAC^ : 1260 



* 1280 * 1300 * 1320 
LpCSa : AGGGCCACGGCTGCTTAAATCTGGGAGCTGCTATACnTOTGTTATC^ : 1320 

* 1340 * 1360 * 1380 
LpCSa : AATAAACTAATAATGCCGCCAGGACACTTCACTGGTGGTCATGTGAAQTTGGTAGTAGAA ; 1380 



* 1400 * 1420 * 1440 
LpCSa : TGCACTTGTAACGTGTTGTTAATTTGTTATCCTGCAATGTACGCTCTATAAACTGT^ r 1440 

* 1460 * 1480 * 1500 
LpCSa s TATCXTGAAAGTCTKUITCATGTGGACCAATGAAGACATAGATCAAGTTCTTO : 1500 



* 1520 * 1540 * 

LpCSa : CGGCGGCTGTTTCTTTGGGAAAAAACTTTTTATGGGAGTCTTTTTTTACC : 1550 



^^^ure 2 Deduced amino acid sequence of LpCSa 

* 20' * 40 * 60 

LpCSa : YIDGDEGIIiRYRGyPIEBVMSSSFVEVAYIjIiMYGNLPTQSQLAGWEFAISQH : 60 



* 80 * 100 * 120 
LpCSa : IJJIIQSMPHDAHPMGVLASAMSTLSVFHPDANPALRGQDIjYKSKQV^ : 120 

* 140 * 160 * 180 
LpCSa : lAAAAYIiRLAQRPFVLPSNNIiSYSENPLYMIiDSM : 180 

* 200 * 220 * 240 
LpCSa : MNCSTAAVRHIASSGVDVFTMjSGAVGALYGPLHGGANEAVLKMI^ : 240 



* .260 * 280 * 300 

LpGSa : VKKTRKIUa^SGFGHRVYKiraDPRAICVIRK^ : 3 00 



* 320 * 340 * 360 

IjpCSa : KIUCLYPNVDFYSGIiIYiy^GFPTEFFPVLFAVPRMAGWIiAHWKESI^ : 360 



* 3*80 * 400 

LpCSa : TGTWLRHYTPVRERVPSSDSEQXiGQIATSNATRRRRAGSAIi : 401 



ure 3 Nucleotide sequences of nucleic acid fragments contributing to the 
isensuB contig secjuence LpCSa 



20 



40 



60 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4- 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaS 



GNTJTTATATTGACGGGGATGAGGGAATTCTTCGCTACAGAGGCTATCqAATTGAGGAGGT 



60 



80 



100 



120 



LpCSal 
LpCSa2 
LpCSaB 
LpCSa4 
LpCSaS 
LpCSaS 
LpCSa? 
LpCSaS 



GG'CTGAAAGCAGCTCGTTTGTTGAGGTCGCCTACCTCTTAATGTATGqGAATTTGCCCAC 



120 



140 



160 



180 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaS 



CCAGAGTCAACTGGCAGGCTGGGAGTTTGCAATTTCGCAGCACTCTGGTGTTCCTCAAGG 
GCAGGCTGGGAGTTTGCAATTTCGCAlCACTCTGGTGTTCCTCASjGr 



180 
46 



200 



220 



240 



LpCSal 
LpCSa2 
LpCSaS 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaS 



ACTCTTGGATATAATACAATCAATGCCTCATGATGCCCACCCCATGGGTGTCCTTGCCAG 
ACTCTTGGATATAATACAATCAATGCCTCATGATGCCCACCCCATGGGTGTCCTTGCCAG 



240 
106 



260 



280 



300 



•LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaS 



TGCAATGAGCACACTTTCAGTCTTCCATCCAGATGCAAACCCTGCTCIjTAGAGGTCAAGA 
TGCAATGAGCACACTTTCAGTCTTCCATCCAGATGCAAACCCTGCTCTITAGAGGTCAAGA 



300 
166 



Sal 
ljpCSa2 
LpCSa3 
IjpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSad 



320 



340 



360 



TCTATACAAGTCGAAGCAGGTTAGGGATAAGCAAiVTTGTACGAGTTCTTGGGAAGGCACC 
TCTATACAAGTCGAAGCAGGTTAGGGATAAGCAAATTGTACGAGTTCTTGGGAAGGCACC 



LpCSal 
LpCSa2 
LpCSa3 
XjpCSa4 
LpCSaS 
£jpCSa6 
LpCSa? 
LpCSaS 



380 



400 



420 



AGTAATAGCAGCTGCAGCCTATCTGAGATTAGCAGGAAGGCCUTTTG'i'CCTTCCTTC 
AGTAATAGCAGCTGCAGCCTATCTGAGATTAGCAGGAAGGCCCTTTGyCCTTCCTTCAAA 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaB 



440 



460 



480 



TAATCTCTCTTATTCAGAAAATTTCTTGTATATGCTGGACTCTATGGGTGACAAAGATTA 
TAATCTCTCTTATTCAGAAAATTTCTTGTATATGCTGGACTCTATGGQTGACAAAGATTA 



500 



520 



540 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaB 



TAAGCCAAATCCCAGACTTGCCCGGGTTCTGGATGTCCTTTTTATTC'IjTCATGCTGAAC 

taagccaaatcccagacttgcccgggt tctggatgtcctttttattcij tcatgctgaaca 

itJTOtg ctgBac a 



540 
406 
12 



560 



580 



600 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaS 



cgaaatgaactgctcaacagctgctgttaggcaccttgcttcaagtgg,tgtcgatgtctt 

CGAAATGAACTGCTCAACAGCTGCTGTTAGGCACCTTGCTTCAAGTGGpGTCGATGTCTT 
CGAAATGAi?lCTGCTCAACAGCTGCTGTTAGGCACCTTGCTTCAAGTGG.TGTCGATGTCTT 



600 
466* 
72 



620 



640 



660 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
LpCSaS 



cactgctctttctggtgctgttggagctctatatggtccactgcatggsiggcgcaaatga 
cactgctctttctggtgctgttggagctctatatggtccactgcatggjtggcgcaaatga 
cactgctctttctggtgctgttggagctctatatggtccactgcatggItggcgcaaatga 



660 
526 
132 



CSal 
LpCSa2 
LpCSaS 
ljpCSa4 
LpCSaS 
LpCSa6 
LpCSa7 
LpCSaS 



680 



700 



720 



SgcggtacttBaaatgttaaatgagattggaagtgtagagaatattcgggaattcattga 

GGCGGTACTTAAAATGTTAAATGAGATTGGAAGTGTAGAGAATATTCGGGAATTCATTGA 
GGCGGTACTTAAAATGTTAAATGAGATTGGAAGTGTAGAGAATATTCqGGAATTCATTGA 



719 
586 
192 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
IipCSa7 
XipCSaS 



740 



760 



7B0 



GGGAGTGAAGAACAGGAAGCGGAAAATGTCTGGjSTTTGGGCAq 
GGGAGTGAAGAACAGGAAGCGGAAAATGTCTGGTTTTGGGCACCGTG1GTATAAGAATTA 
GGGAGTGAAGAACAGGAAGCGGAAAATGTCTGGBTTTGGGCACCGTGOj'GTATAAGAATTA 



763 
646 
252 
2 



LpCSal 
LpCSa2 
IipCSaB 
IjpCSa4 
LpCSaS 
LpCSa6 
LpCSa7 
LpCSaS 



800 



820 



840 



TGATCCTCGTGCTAAAGTCATCCGGAAGTTAGCGGi 
TGATCCTCGTGGTAAAGTCATCCGGAAGTTAGCGGAGGAGGTTTTCAGGATTGTGGGACG 



tHatcctcgBgctaaagtcat 



"cSgEagttagcggaggaggttttcaggattgtgggacg 
"ggaagttagcggaggaggttttcaggattgtgggacg 



682 
312 
61 
37 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa7 
LpCSa8 



860 



880 



900 



ggatcctcttatcgaggtagctgttgctttggagaaggsagcactgtdagacgagtattt 
ggatcctcttatcgaggtagctgttgctttggagaaggcagcactgtgagacgagtattt 



3GiSTCCTCTTATCGAGGTAGCTGTTGCTTTGGAGAAGGCAGCACTGTaAGACGAGTATTT 

Ix f^ QAGACGAGTATTT 
GTqAGACGAGTATTT 



372 
121 
97 
16 
15 



LpCSal 

LpCSa2 

LpCSa3 

LpCSa4 

LpCSaS 

LpCSa6 

LpCSa7* 

LpCSa8 



920 



940 



960 



TATCAAGAGGAAGCTGTATCCAAATGTGGATTTTTATTCTGGCCTAA'l|ATATAGGGCAATi 
TATCAAGAGGAAGCTGTATCCAAATGTGGATTTTTATTCTGGCCTAAljATATAGGGCAAT 
TATCgAGAGGAAGCTGTATCCAAATGTGGATTTTTATTCTGGCCTAAT|ATATAGGGCAAT 
TATCAAGAGGAAGCTGTATCCAAATGTGGATTTTTATTCTGGCCTAATjATATAGGGCAAT 
TATCAAGAGGAAGCTGTATCCAAATGTGGATTTTTATTCTGGCCTAAT|ATATAGGGCAAT 



980 



1000 



1020 



LpCSal : 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa7 
LpCSaS 



GGGATTCCCTACAGAGTTTTTCCCTGTTCTGTTTGCAGTTCCTCGCAljGGCTGGTTGGTT 
GGGATTCCCTgCAGAGTTTTTCCCTGTTCTGTTTGCAGTTCCTCGCAT,GGCTGGTTGGTT 
GGGATTCCCTACAGAGTTTTTCCCTGTTCTGTTTGCAGTTCCTCGCATpGCTGGTTGGTT 
GGGATTCCCTACAGAGTTTTTCCCTGTTCTGTTTGCAGTTCCTCGCATpGCTGGTTGGTT 
GGGATTCCCTACAGAGTTTTTCCCTGTTCTGTTTGCAGTTCCTCGCATjGGCTGGTTGGTT 



492 
241 
217 
136 
135 



m 



Sal 
LpCSa2 
I.pCSa3 
LpCSa4 
LpCSaS 
IjpCSa6 
LpCSa? 
LpCSaS 



1040 



1060 



1080 



AGCACATTGGAAGGAGTCACTTGATGACCCCGACAATAAAATTATGAGGCCCCAACAGGT 
AGCACATTGGAAGGAGTCACTTGATGACCCCGACAATAAAATTATGAC^GCCCCAACAGGT 
AGCACATTGGAAGGAGTCACTTGATGACCCCGACAATAAAATTATGAGGCCCCAACAGGT 
AGCACATTGGAAGGAGTCACTTGATGACCCCGACAATAAAATTATGAGGCCCCAACAGGT 
AGCACATTGGAAGGAGTCACTTGATGACCCCGACAATAAAATTATGAqGCCCCAACAGGT 



552 
301 
277 
196 
195 



1100 



1120 



1140 



LpCSal 
LpCSa2 
LpCSaS 
LpCSa4 
IipCSaS 
LpCSa6 
LpCSa7 
LpCSaS 



ATACACCGGTACTTGGCTAAGGCATTACACCCCAGTGAGAGAACGGGTGCCATCAAGCGA 
ATACACCGGTACTTGGCTAAGGCATTACACCCCAGTGAGAGAACGGGTGCCATCAAGCGA 
ATACACCGGTACTTGGCTAAGGCATTACACCCCAGTGAGAGAACGGGTGCCATCAAGCGA 
ATACACCGGTACTTGGCTAAGGCATTACACCCCAGTGAGAGAACGGGTGCCATCAAGCGA 
ATACACCGGTACTTGGCTAAGGCATTACACCCCAGTGAGAGAACGGGTGCCATCAAGCGA 



612 

361- 

337 

256 

255 



1160 



1180 



1200 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa7 
LpCSa8 



CAGTGAGCAGCTTGGGCAGATCSCTACATCAAACGCGACGAGGCGTCGGCGTGCTGGOTC 
CAGTGAGCAGCTTGGGCAGATCGCTACATCAAACGCGACGAGGCGTCGGCGTGCTGGCTC 

cagtgagcagcttgggcagatcgctacatcaaacgcgacgaggcgtcggcgtgctggctc 
cagtgagcagcttgggcagatcgctacatcaaacgcgacgaggcgtcggcgtgctggctc 
cagtgagcagcttgggcagatcgctacatcaaacgcgacgaggcgtcggcgtgctggctc 
ggcagatcgctHcatcaaacgcgBcgaggcgtcggcgtgctggctc 



672 
421 
397 
316 
315 
45 



1220. 



1240 



1260 



LpCSal 
LpCSa2 
LpCSaS 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa7 
LpCSaS 



TGCCCTGTAGAACAGTCTGCATGATACAGCATACAGTCCACACAATA^jACCAAGCTGCCA 

tgccctgtagaacagtctgcatgatacagcatacagtccacacaataaaccaagctgcca 
tgccctgtagaacagtctgcatgatacagcatacagtccacacaataaaccaagctgcca 
tgccctgtagaacagtctgcatgatacagcatacagtccacacaataaaccaagctgcca 
jtgccctgtagaacagtctgcatgatacagcatacagtccacacaataaaccaagctgcca 
itgccctgtagaacagtctgcatgatacagcatacagtccacacaataiv^ccaagctgcca 



732 
481 
457 
376 
375 
105 



* 1280 * 1300 * 1320 

LpCSal : 

LpCSa2 

LpCSaS : . \3b3^dsblM^^9liiM^^fij^Si^ 
LpCSa4 
LpCSaS 
LpCSa6' 
LpCSa7 
LpCSa8 



agggccacggctgcttaaati 

agggccacggctgcttaaatctgggagctgctatacttgtgttatcaggtatatjcjtaggc 
agggccacSgctgcttaaatctgggagctgctatacttgtgttatcaggtatatataggc 
agggccacggctgcttaaatctgggagctgctatacttgtgttatcaggtatatataggc 
agggccacggctgcttaaatctgggagctgctatacttgtgttatcaggtatatataggc 

lAGGGCCACGGCTGCTTAAATCTGGGAGCTGCTATACTTGTGTTATCAGGTATATATAGGC 



753 
541 
517 
436 
435 
165 



LpCSal 
LpCSa2 
LpCSaS 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa7 
LpCSa8 



1340 



1360 



1380 



AATAAACTAATAATGCCGCCAGGACACTTCACTGGTGGTCATGTGAAdTTGGTAGTAGAA 
AATAAACTAATAATGCCGCCAGGACACTTCACTGGTGGTCATGTGAAGTTGGTAGTAGAA 
AATAAACTAATAATGCCGCCAGGACACTTCACTGGTGGTCATGTGAAdTTGGTAGTAGAA 
AATAAACTAATAATGCCGCCAGGACACTTCACTGGTGGTCATGTGAAdTTGGTAGTAGAA 
AATAAACTAATAATGCCGCCAGGACACTTCACTGGTGGTCATGTGAAdTTGGTAGTAGAA 




1400 



1420 



1440 



Sal 



LpCSa2 
LpCSaS 
LpCSa4 
LpCSaS 
LpCSa6 
LpCSa? 
XipCSaS 



TGCACTTGTAACGTGTTGTTAATTTGTTATCCTGCAATGTACGCTCT7|.TAAACTGTTCAG 
TGCACTTGTAACGTGTTGTTAATTTGTTATCCTGCAATGTACGCTCT/^TAAACTGTTCAG 
TGCACTTGTAACGTGTTGTTAATTTGTTATCCTGCAATGTACGCTCTATAAACTGTTCAG 
TGCACTTGTAACGTGTTGTTAATTTGTTATCCTGCAATGTACGCTCTij.TAAACTGTTCAG 
TGCACTTGTAACGTGTTGTTAATTTGTTATCCTGCAATGTACGCTCTATAAACTGTTCM 



661 
637 
556 
555 
285 



1460 



1480 



1500 



LpCSal : 
LpCSa2 
LpCSaB 
IjpCSa4 
LpCSa5 
LpCSa6 
LpCSa? 
LpCSa8 



T^TCTTGAAAGTCTTAA' 

tatcttgaaagtcttaS' 
tatcttgaaagtcttaa' 

TATCTTGAAAGTCTTAA' 
iTATCTTGAAAGTCTTAAI 




GAAGACATAGATCAAGTTCTTTGCATGGG 



G AAG AC AT AG ATC AAGTTCTTTG C ATGGG 
~ " ^Af3A5A5A' 



720 
666 
615 
597 
310 



LpCSal 
LpCSa2 
LpCSa3 
LpCSa4 
LpCSa5 
LpCSa6 
XipCSa7 
LpCSaS 



1520 



1540 



cggcggctgtttctttggSjaaaaaa 



CGGCGGCTGTTTCTTTG 



T@TTTCC1C 



CTTTTTATGGGAGTCTTTTTTTACC 



745 
665 



^^^re 4 Consensus contig nucleotide sequence of Lj^CSb 

* 20 * 40 * 60 
LpCSb : CTTCrCCCTGTN^ACTGCTCTCCAATGACAOVGTOT : 60 

* 80 * iOO * 120 
LpCSb : TTGAGAGTGAATTTGGAAAGGCTTATGAGAAGGGAATCOVTAAATa^AAGTT^ : 120 



* 140 * 160 * 180 

lipCSb : CTACATATQAAGATAGCTTAAATTTOATTGCTCGGOTTCCACAAGTGGC^ : 180 



★ 200 * . 220 * 240 

LpCSb : ACCGGAGAATTTTCAAGGACGGGAAAACTATTGO^GCrGATAATACACT^ J 240 



* 260 * 280 * 300 
LpCSb : CTAATTTTTCy^CTVCATGCTTGGTTTTGATGACCCCAAAATGCTGGAGTTGATGCGCCTAT : 300 

* 320 * 340 * 360 
LpCSb ; ACaVTAACAATTCAOVCTGATCACGAAGGAGGQAATGTTAGTGCTCATGCTGGG^ : 360 



* 380 * 400 * 420 
LpCSb : TTGGAAGTGCTCTGTCAGATCCTTATCTTTCrrTTO s 420 

* 440 * 460 * 480 
LpCSb : GACCACTGCACGGCTTGGCTAATCAGGAAGTGTTGTNATGGATCAAATCTGTGATGGAAG : 480 

* 500 * 520 * 540 
LpCSb : AAAGCGGGAGTAACATTACAACTGATCAGCTTAAAGAATATGTTTGGAAGACACTGAAGA : 540 



* 560 * 580 * 600 

LpCSb : GTGGAAAGGTTGTTCCTGGCTATGQTa^TGGAGTTCrACGTAATACAGATCCACGATA : 600 



* 620 * 640 * 660 

LpCSb : CGTGCCAAAGGGAGTTTGCACTGAAGTATTTACCCGAAGACCCACTTTTCCAACTGGTCT : 660 



* ; 680 * 700 * 720 
LpCSb : CCAAOTTGTACGJMWSTTGTGCCTCCTATCCTCACCGAGTTAGGCAAGGTAAAA^^ : 720 

* 740 * 760 * 780 
LpCSb : GGCCTAATGTTGATGCTCACAGTGGAGTTTTGCTCAACCACTTC : 780 



* 800 * 820 * 840 

LpCSb : GGTACTACACTGTCTTGTTCGGCGTCTCAAGGAGCATGGGAATTGGATCTCAGCTCATTT : 840 



* 860 * 880 * 900 

LpCSb : GGGACCGTGCCCTCGGCCTGCCACTTOAAAGACCGAAGAGTGTCACCATGGAGTGGCTGG : 900 



* 920 * '940 * 960 

LpCSb ! AAAACCACTGCAAGAAGGCTGCGGCCTGAAGCTACACCAATGCTTCGTOT : 960 



• * 980 ♦ 1000 * 1020 

Sb : GCCGTCn-TTGATGTTAATAATGACTGAQCa^TAAGTTAGGCA^^ s 1020 

* 1040 * 1060 * 1080 
LpCSb : ATCTTCGTraTCCTGGCCAATAACTGGAGCJ^GAGGCTCACAGACGGTAGAATT^ : 1080 

* 1100 * 1120 * 1140 
LpCSb : CCACCGOTACrrGAACACCGAATCABrra H^O 

1^ 1160 
LpCSb : GACACATAAGTTTTATGTGTCGCTCGG : 1167 



^^^re 5 Deduced amino acid sequence of LpCSb . 

* 20 '* 40 * 60 

LpCSb : SPCXCSPMTQFTTGVMALQVESEPAKAYEKGIHKSKFWEPTYEDSliOTiIAI^ i 



* 80 * 100 * 120 

LpCSb : RRIFKDGKTIAADNTIiDYAANFSHMLGPDDPKMLELMIU^yiTIHTDH^ : 



* 140 * 160 * 180 

LpCSb : QSALSDPYLSFAAALNGLAGPLHGLANQEVLXWIKSVMBBTGSNITTDQLKBYWK^ : 



* 200 * 220 * 240 

LpCSb : GKNn^GYGHGVLRNTDPRYSCQRBPALKYLPEDPLPQLVSKLYEVVPPILTBLGKVKOT : 



* 260 * 280 * 300 

LpCSb : pWlDAHSGVLLNHFGLVEARYYTVLFGVSRSMGIGSQLIWDIUajGLPLiiaiPKSVTMEWLE : 



LpCSb : NHCKKAAA : 308 



s 



re 6 Nucleotide sequences of nucleic acid fragments contributing to the 
Bensus contlg sequence LpCSb 



20 



40 



60 



LpCSbl : 

LpCSb2 : 

LpCSbS : 

IipCSb4 : 



rT^CTCCCTGTMACTGCTCTCCAATGACACAGTTTACCACTGGAGTGA|TGGCACTCGAAG 



60 



100 



120 



LpCSbl 
I,pCSb2 
IipCSb3 
IipCSb4 



^T-rz7ir:!nnT-na2XTTTr::rAAAC:;nr!TTATGAGAAGGGAATTGATAA ATCAAiAGTTCTGGGAGC 



120 



140 



160 



180 



LpCSbl 
LpCSb2 
LpCSb3 
IipCSb4 



rTACATATGAAGATAGCTTAAATTTGATTGCT.CGGCTTCCACAAGTGqCTTCATATGT.TT 



180 



200 



220 



240 



LpCSbl 
I*pCSb2 
LpCSb3 
IipCSb4 



&r<r-P!nAGAATTTTCAAGGACGGGAAAACTATTGCAGCTGATAATACAqTGGACTACGCAG 



240 



260 



280 



300 



lipCSbl 
XipCSb2 
IipCSb3 
IipCSb4 



PTAATTTTTCACACATGCTTGGTTTTGATGACCCCAAAATGCTGGAGT|TGATGCGCCTAT 



300 



320 



340 



360 



LpCSbl 
IipCSb2 
LpCSbS 
ItpCSb4 



ArATAArAATTrACACTGATCACGAAGGAGGGAATGTTAGTGCTCATGpTGGGCATCTGG 



3 60 



380 



400 



420 



LpCSbl 
LpCSb2 
IipCSb3 
IipCSb4 



TTGGAAGTGCTCTGTCAGATCCTTATCTTTCTTTTGCAGCGGCACTGAkCGGTTTAGCTG 



420 



440 



460 



480 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



GACCACTGCACGGCTT GGCTAATCAGGAAGTGTTG TBATGGATCAAATpTGTGATGGAAG 

^ ~ tmatggatBTiaat.ctgtgatggaag 



480 
24 



500 



520 



540- 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



AAACCGGGAGTAACATTACAACTGATCAGCTTAAAGAATATGTTTGGAkGACACTGAAGA 
AACCGGGAGTAACATTACAACTGATCAGCTTAAAGAATATGTTTGGAfGACACTGAAGA 



CTGAAGA 



540 
83 
7 



* 560 



GTGGAAAGGTTGTTCCTGGCTATGGTCATGGAGTTCTACGTAATACAGATCCACGATACT 
GTGGAAAGGTTGTTCCTGGCTATGGTCATGGAGTTCTACGTAATACAGATCCACGATACT 
GTGGAAAGGTTGTTCCTGGCTATGGTCATGGAGTTCTACGTAATACAGATCCACGATACT 



LpCSb2 
LpCSb3 
LpCSb4 



CGTGCCAAAGGGAGTTTGCACTGAAGTATTTACCUGAAGACCCACTTTTCCAACTGGTCT 
CGTGCCAAAGGGAGTTTGCACTGAAGTATTTACCCGAAGACCCACTTTTCCAACTGGTCT 

cgtgccaaagggagtSEgiSactgaagtatttacccgaagacccacttttccaactggtct 



ccaagttgtaBgaagttgtgcctcctatcctcacSgagttaggcaaggtaaaaaacccat 
ccaagttgtacgaagttgtgcctcctatcctcaccgagttaggcaagcptaaaaaacccat 

CCAAGTTGTACGAAGTTGTGCCTCCTATCCTCACCGAGTTAGGCAAGC^TAAAAAACCCAT 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



ggcctaatgttgatgctcacagiSggagttttgctcaaccagttcggattagttgaaBcac 
g^cctaatgttgatgctcacagtggagttttgctcaaccacttcggattagttgaagcac 
ggcctaatgttgatgctcacagtggagttttgctcaaccacttcggattagttgaagcac 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



800 



820 



840 




802 
383 

307 
22 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



860 



880 



900 



gggaccgtgccctcggcctgccacttgaaagaccgaagagtgtcacc;i.tggagtggctgg 
gggaccgtgccctcggcctgccacttgaaagaccgaagagtgtcaccatggagtggct.gg 
gggSccgtgccctcggcctgccacttgaaagaccgaagagtgtcacc^^tggagtggctgg 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



920 



940 



960 



aaaaccactgcaagaaggctgcggcctgaagctacaccaatgcttcgttttacaaatcag 

aaaaccactgcaagaaggctgcggcctgaagctacaccaatgcttiSgttttacaaatca! 

aaaaccactgcaagaaggctgcggcctgaagctacaccaatgcttcgttttacaaatcag 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



980 



1000 



1020 



gccgtctttgatgttaataatgactgagcataagttaggcatggttagccttgttttacc 
gccgtctttgatgttaataatgactgagcataagttaggcatggStagccttgttttacc 
gccgtctttgatgttaataatgactgagcataagttaggcatggttagcct tgttttacc 



LpCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



1040 



1060 



1080 



atcttcgttttcctggccaataactggagcaagaggctijacagacggtagaattttgtaa 
atSjttcgttttcctggccaataactggagcaagaggctcacagacggtagaattttgtaa 
atcttcgttttcctggccaataactggagcaagaggctcacagacggtagaattttgtaa 



1100 



1120 



1140 



fCSbl 
LpCSb2 
LpCSb3 
LpCSb4 



CCACCGNTACTTGAACACCGAATCANTTAAATGTCATTTGGCATAAAGAGATTAGGACATi 

ccaccgStacttgHacaccgaatSanStaaatgSSatttggcataaagagattaggacat 

'cr'TXPrr.gTaPTTriAarArrRAATCASSTTRAATGT CATTTGGCATAAAGAGATTAGGACAT 



683 
606 
322 



1160 



LpCSbl 
IipCSb2 
XipCSb3 
LpCSb4 



ga.cacataagttttatgtgiscgj5tcgg 
gacacataagttttatgtgtcgctcgg 
gacacataagttttatgtgtcgctcgS 



^^^re 7 Nucleotide sequence of LpCSc 

* 20 * 40 * 60 

LpCSc : TClSrCCGTGGCCANAATNCCCCMJCATTCAAATACCGCCOT^ s 60 



* 80 * 100 * 120 
LpCSc : CTTCTTATTTCCACCCCAACCGCCCAACyVTGTGTCCTCCC^^ : 120 

* 140 * 160 * 180 
LpCSc : CAACGGCCATAGCAACGGOlCCAACGGCGCaUlTGGCTCC^ : 180 



* 200 * 220 * 240 

LpCSc : C^CGACCAGACyVGAACCCTCACCCTACACACAAGAGCCCATATGCACCTGT^ : 240 



* 260 * 280 * 300 

IipCSc : TTTGTCAAATGTCGGCCGCTTCTUJlGATTATCGAGAGCa^CATTAAGAGAGGGCGAGC^ : 300 



* 320 * 340 * 360 
LpCSc : CGCCAACGCCTACTTCGACCTTGAGGCTAAAATCATVGATCGCCAGAGCTCTCGACAAC^ : 360 

* 380 * 400 * 420 
LpCSC : TGGTGTTGACTACATTGAAGTTACCAGGCCTGCTGCCTCTGAGCAGTCAAGAA^^ : 420 

* 440 * 460 * 480 
LpCSc : CGAAGCCCTCTGCAAGCTCGGATTGAAAGCCAAGATCCTTACCCACGTACGATGCCACAT : 480 

* 500 * 520 * 540 
LpCSc : GGACGATGCCAGAATCGCTQTCGAGACTGGTCTTGACGGCCTCGATGTCGTCATTQGAAC : 540 

* 560 * 580 * 600 
LpCSc : CTCTGCGTACCTCCGCGAGCACAGCCATGGCAAGGACATGACATACATCAAAAACAC^^ : 600 



* 620 * 

LpCSc : GCTGGAGGTGATTGAGTTTGTCAAGAGCAAGGGAN 635 



^^^ure 8 Deduced amino acid sequence of LpCSc 

* 20 *•. 40 * 60 
LpCSc : XRGXNXPXFKYRPSATNPPTFIjFPPQPPNMCPPTBXTPATNGHSNGTNGANGSKEGFTGV : 60 

* 80 * 100 * 120 
LpCSc : TTRQNPHPTHKSPYAPVGDFI/SWGRFKIIESTIiREGEQFANAYFDLEAKIKIARAXiDNF : 120 



* 140 * 160 * 180 

LpCSc : GVDYIBVTSPAASEQSRraCEALCKLGIiKAKILTHVRCHMDDARIAV^ t 180 



* 200 * 

LpCSc : SAYXiREHSHGKDMTYXKNTALEVIEFVKSKG : 211 



^^^bire 9 Nucleotide sequence of LpCSd 

* 20 * 40 * 60 
LpCSd : GTGOTATGGCNCJ^CCAGNANTCCTNCGTNCTGG^ : 60 

* 80* * 100 * 120 
LpCSd : CAACGACCTCAGCGATCT^GGCCATCAAGGACTACCTGTGGTCCACCCT : 120 



* 140 * 160 * 180 

LpCSd : AGTCQTTCCCGGTTACGGACACGCCGTTCTCCGCAAGACCX3ACCCCCGCTACGTCTCCCA : 180 



* 200 * 220 * 240 

LpCSd : GCGCX3AGTTCGCCCAGAAGCACCTTCCCQACGACCCAATGTTCAAGCTCGTCAGTCAGGT ; 240 



* 260 * 280 * 300 
LpCSd : CTACAAGATCGCCCCTGGTGTTCTCACCGAGCACGGCAAGACCAAGAACCCCTACCCCAA : 3 00 

* 320 * 340 * 360 
LpCSd : CGTCGACGCCCACTCCGGTGTCCTCCTCCAGTACTACGGCCTCACTGAGCAGAACTACTA : 360 

* 380 * 400 * 420 
LpCSd : CACCGTTCTCTTCGGTGTATCCCGTGCQCTCGGTGTCCTTCCCCAGCTTATCATTGACCG r 420 



* 440 * 460 * 480 

LpCSd : TGCCGTCGGTGCCCCCATTGAGAGGCCCAAGTCTTTCAGCACTGAGGCTTACGCCAAGTT' : 4 80 



* 500 * 520 * 540 
LpCSd : GGTTGGTGCTAAGTTGTAT^CGCGTTACTGCAACGTGCTCTACAGCCAGGAGl^ : 540 

* 560 * 580 * 600 
LpCSd : GGAATTTGTTTAAGATTCAGAGATACCTTGTCCTGTGTAGAATTGa s 600 



* 620 * 

LpCSd : GAATGGGAGCGTTACGGCGCTACATCACTACATTTN : 636 




Igure 10 Deduced amino acid sequence of LpCSd 



* 20 * 40 ' * 60 
LpCSd : XYGXXXXPXXraPXXXXAIGNDliSDQMKDYLWSTOKAGQVWGYGHAA^ : 60 

* 80 * 100 I . * 120 
LpCSd : REFAQKHIiPDDPMFKIiVSQVYKIAPGVLTEHGKTKNPYPNVrJifflSGVIjLQYYGIiT^ : 120 

I 
I 

* 140 * 160 

LpCSd : TVLFQVSRMiGVLPQLIIDRAVGAPIERPKSFSTBAYAKLVGAKL : 165 



^^ure 11 Consensus contig nucleotide sequence of LpMDHa 

* 20 * 40 * 60 
LpMDHa : GGTTGGTTQCTXSGTATCACCATTCTGCCCTGTTCTCACAGGCAACT : 60 

* 80 * 100 * 120 
LpMDHa : ATTGTCTAGTGAAGACATCAAGGCTCTCACCAAGAGGACACAGGAGGGTGGGACAGAAGT : 120 

* 140 * 160 * . 180 
LpMDHa : TGTTGAGGCAAAGGCTGGAAAGGGATCTGCAAGCTTGTCCATGGCGTATGCT^ s 180 



* 200 * 220 * 240 

LpMDHa : TTTTGGTGATGCATGCTTGAAGGGTCTGAAOaGAGTTCCTO : 240 



* 260 * 280 * 300 

LpMDHa : CGTGCAATCAACTATCACAGAACTGCCATTCTTTGCCTCCAAGGTGAGGCTCGGGAAGAA : 300 



* 320 * 340 * 360 

LpMDHa : TGGAGTCGAGGAAGTGCTTGGTTTGGGTXiaiGCTGTCGGCCTTTGAGAAGGAAGG : 360 



* 380 * 400 * 420 

LpMDHa : AAGTCTCTU^GGGTGAGCTCaVAGTCTTCy^ATTGAa^^ ' 420 



★ 440 * 460 * 480 
LpMDHa : TTAATTAATTTTGO^GATTATAGCAAACCAGGTCTAGTTAAGGGGTCTGTTTTTG^^ : 480 

♦ 500 * 520 * 540 
LpMDHa : TTGTTCAGTGCTTTTTCTQCCCATCACGTGGGCATGQAAGATTTGAGC^ ' 540 



* 560 * • 580 * 600 
LpMDHa : ATCCGGCGGCGTAATGCCACAGAACATTACTTGTACa^GAGGGAACTAGTTaS : 600 

* 620 * 640 * 660 
LpMDHa : TTTTGAACTGGTACATTAAACGAACAATTGCTGATGCACTTTGAG^^ : 660 



* 680 * 

LpMDHa : GTGANTCCATTGGCCTCAAGCCAAAAAAA2\AAAAAA : 696 



^l^^ire 12 Deduced amino acid sequence of LpMDHa 

* 20 * 40 * 60 
LpMDHa : VGCTYHHSALFSQATPSTNALSSEDIKAI.TKRTQEGGTEVVEAKAGKGSATLS^^ : 60 

* 80 * 100 * 120 
LpMDHa : FGDACLKGLNGVPDIVECSYVQSTITELPPFASKVRLGKNGVBEVLGLGELSAFEKEGLE : 120 



* • 140 
LpMDHa : SLKGHLKSSXDKGXAFANAS : 140 



ns 



xe 13 Nucleotide sequences of nucleic acid fragments contributing to the 
Insensus contig sequence LpMDHa 



LpMDHal 
LpMDHa2 
LpMDHa3 
ripMDHa4 
LpMDHaS 
IipMDHa6 
lipMDHa? 



20 



40 



60 



G'^TGGTTGCTGGTATCACCATTCTGCCCTGTTCTCACAGGCAACTdCTTCGACTAATGC 
j^TGGTTGCTGGTATCACCATTCTGCCCTGTTCTCACAGGCAACTGCTTCGACTAATGCj 
Ic^GllTGCTGGTATCACCATTCTGCCCTGTTCTCACAGGCAACTqCTTCGACTAATGC 
"iGGTTGCTGGTATCACCATTCTGCCCTGTTCTCACAGGCAACTqCTTCGACTAATGC 



60 
59 
59 
56 



IipMDHal 

XjpMDHa2 

XipMDHaB 

LpMDHa4 

LpMDHaS ^ 

IipMDHa6 

IipMDHa? 



80 



100 



120 



ATTGTCTAGTGAAGACATCAAGGCTCTCACCAAGAGGACACAGGAGGGTGGGACAGAAGTl 
ATTGTCTAGTGAAGACATCAAGGCTCTCACCAAGAGGACACAGGAGGGTGGGACAGAAGT 
ATTGTCTAGTGAAGACATCAAGGCTCTCACCAAGAGGACACAGGAGGGTGGGACAGAAGT 
ATTGTCTAGTGAAGACATCAAGGCTCT CACCAAGAGGACACAG GAGGGTGGGACAGAAGT 

GAGGGTGGGACAGAAGT 



120 
119 
119 
11€ 
17 



140 



160 



180 



LpMDHal 
IipMDHa2 
LpMDHaa 
LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHa? 



LpMDHal 
LpMDHa2 
LpMDHaS 
LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHa? 



LpMDHal 
LpMDHa2 
LpMDHaS 
LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHaT 



TGTTGAGGCAAAGGCTGGAAAGGGATCTGCAACCTTGTCCATGGCGTATGCTGGCGCAGTi 
TGTTGAGGCAAAGGCTGGAAAGGGATCTGCAACCTTGTCCATGGCGTATGCTGGCGCAGTi 
TGTTGAGGCAAAGGCTGGAAAGGGATCTGCAACCTTGTCCATGGCGTJATGCTGGCGCAGT 
TGTTGAGGCAAAGGCTGGAAAGGGATCTGCAACCTTGTCCATGGCGTATGCTGGCGCAGT 
TGTTGAGGCAAAGGCTGGAAAGGGATCTGCAACCTTGTCCATGGCGTJATGCTGGCGCAGT 



TTTTGGTGATGCATGCTTGAAGGGTCTGAACGGAGTTCCTGACATTGTTGAATGCTCCTA^ 
TTTTGGTGATGCATGCTTGAAGGGTCTGAACGGAGTTCCTGACATTGTTGAATGCTCCTA 
TTTTGGTGATGCATGCTTGAAGGGTCTGAACGGAGTTCCTGACATTGTTGAATGCTCCTA 
TTTTGGTGATGCATGCTTGAAGGGTCTGAACGGAGTTCCTGACATTGTTGAATGCTCCTA 
TTTTGGTGATGCATGCTTGAAGGGTCTGAACGGAGTTCCTGACATTGTTGAATGCTCCTA 







260 


HI 






m 



CGTGCAATCAACTATCACAGAACTGCCATTCTTTGCCTCCAAGGTGAGGCTCGGGAAGAA 

cgtgcaatcaactatcacagaactgccattctttgcctccaaggtgaggctcgggaagaa 
cgtgcaatcaactatcacagaactgccattctttgcctccaaggtgaggctcgggaagaa 
tjgtgcaatcaactatcacagaactgccattctttgcctccaaggtgaggctcgggaagaa 

^gtI^E.gctcggSIagaa 

AA 



180 
179 
179 
176 
77 



















200 




220 




240 





240 
239 
239 
236 
137 



300 
299 
299 
296 
197 
17 
2 



320 



340 



360 



LpMDHal 
LpMDHa2 
LpMDHaS 
LpMDHa4 
LpMDHaS 
LpMDHa 6 
LpMDHa7 



TGGAGTCGAGGAAGTGCTTGGTTTGGGTGAGCTGTCGGCCTTTGAGAAGGAAGGTTTGGA 
TGGAGTCGAGGAAGTGCTTGGTTTGGGTGAGCTGTCGGCCTTTGAGAAGGAAGGTTTGGA 
TGGAGTCGAGGAAGTGCTTGGTTTGGGTGAGCTGTCGGCCTTTGAGAAGGAAGGTTTGGA 

tggagtcgaggaagtgcttggtttgggtgagctgtcggcctttgagaaggaaggtttgga 
tggagtcgaggaagtgcttggtttgggtgagctgtcggcctttgaHaaggaaggtttgga 
tggagtcgaggaagtgcttggtttgggtgagctgtcggcctttgagaaggaaggtttgga 

TGGAGTCGAGGAAGTGCTTGGTTTGGGTGAGCTGTCGGCCTTTGAG^GGAAG GTTTGGA 



360 
359 
359 
356 
257 
77 
62 



MDHal 
LpMDHa2 
LpMDHa3 
LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHa? 



380 



400 



420 



AJ^GTCTCAZVGGGTGAG^CTCAAGTCTTCAATTGACAAGGGCATCGCG'ljTCGCCAATGCGAG 
PLAGTCTCAAGGGTGAGCTCAAGTCTTCAATTGACAAGGGCATCGCGTTCGCCAATGCGAG 
AAGTCTCAAGGCTGAGCTCAAGTCTTCAATTGACAAGGGCATCGCGTITCGCCAATGCGAG 
;2VAGTCTCAAGGGTGAGCTCAAGTCTTCAATTGACAAGGGCATCGCG'i|tCGCCAATGCGAGi 
AAGTCTCAAGGGTGAGCTCAAGTCTTCAATTGACAAGGGCATCGCG11TCGCCAATGCGAG' 
kAGTCTCAAGGGTGAGCTCAAGTCTTCAATTGACAAGGGCATCGCGojTCGCCAATGCGAG 
AAGTCTCAAGGGTGAGMTCAAGTCTTCAATTGACAAGGGCATCGCGTjTCG CCAATGCGAG 



'420 
419 
419 
416 
317 
137 
122 



440 



460 



480 



ttaattaattttgcagattatagcaaaccaggtctagttaaggggtctg^HttgMttt 

TTAATTAATTTTGCAGATTATAGCAAACCAGGTCTAGTTAAGGGGTaTG^HTTGBxTT 

ttaattaattttgcagattatagcaaaccaggtctagttaaggggtatg^httgbttt 
ttaattaattttgcagattatagcaaaccaggtctagttaaggggtgtgbHttgH|tt 

;tt§attaas'tttgcagattatagcaa[i|ccaggtctagtt^aggggtctgtttttgacttt 

TTGATTAAiyXTTGCAGATTATAGCAASlcCAGGTCTAGTT'GAGGGGTCT 
'TT^GATTAAyTTTGCAGATTATAGCAAllCCAGGTCTAGTT^AGGGGTGTGTTTTTGACTTT 



475 
474 
474 
471 
377 
197 
182 



LpMDHal 
LpMDHa2 
LpMDHaS 
LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHa7 



500 



520 



540 



TTGTTCAGTGCTTTTTCTGCCCATCACGTGGGCATGGAAGATTTGAGCTTCACAATAAAA 

TTGTTCAGTGCTTTTTCTGCCCATCACGTGGGCATGGAAGATTTGAGCTTCACAATAAAA 

TTGTTCAGTGCTTTTTCTGCCCATCACGTGGGCATGGAAGATTTGAGCTTCACAATAAJ^ 

TTGOTCASSpCTTTTTCTGCCCATCACGTGNGCATGSAAGATTTGAGCTTm 

TTGTTCAG&GCTTTTTCTGCCCATCACGTGGGCATGGAAGATTTGAGCTTC^ 

TTGTTCAGTGCTTTTTCTGCCCATCACGTGGGCATGGAAGATTTGAGCTTCACAATAAAA 

TTGTTCAGTGCTTTTTCTGCCCATCACGTGGGCATGGAAGATTTGAQCTTCACAATAAAA 



LpMDHal 
LpMDHa2 
LpMDHaS 
.LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHa7 



560 



580 



600 



ATCCGGCGGCGTAATGCCACAGAACATTACTTGTACAAGAGGGAACTjAGTTCGTGTCAAG 
ATCCGGCGGCGTAATGCCACAGAACATTACTTGTACAAGAGGGAACIAGTTCGTGTCAAG 
ATCCGGCGGCGT AATGCCACAGAACATTACTTGTACAAGAGGGAACT[AGTTCGTGTCAAG 

ATNCiCNpCGCGNflBBBBBSB 

ATCcSS'cGGCGTAATGCCACANAACATTACTTGgACAAGAGGGAAC^ 

ATCCGGCGGCGTAATGCCACAGAJVCATTACTTGTACAAGAGGGAACIjAGTTCGTGTCAAG 
ATCCGGCGGCGTAATGCCACAGAACATTACTTGTACAAGAGGGAACT|AGTTCGTGTCAAG 



* 620 * 640 * 660 

LpMDHal 
LpMDHa2 
LpMDHaS 
LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHa7 



TTTTGAACTGGTACATTAAACGAACAATTGCTGATGCACTTTGAGATijAAAAAAAjf 
TTTTGAACTGGTACATTAAACGAACAATTGCTGATGCACTTTGAGAAlAAAAAAAp 
TTTTGAACTGGTACATTAAACGAACAATTGCTGATGCACTTTGAGAAjAAAAAAM 



TTTTGAACTGGNACATTAAAC[iffOpCAA 

TTTTGAACTGCTACATTAAACGAACAATTGiliTGATGCACTTTG 
TTTTGAACTGGTACATTAAACGAACAATTGlliTGA^^"^^^'?^^^^ 



LpMDHal 
LpMDHa2 
LpMDHaS 
LpMDHa4 
LpMDHaS 
LpMDHa6 
LpMDHa7 



680 



GTGANTCCATTGGNCTKfAAGCCWAAAAAAAAA 

XiTGATiTCCATTGlcjBTCAAGiPlAAg^ 



589 
413 



H^ure 14 Consensus contig nucleotide sequence of LpMDHb 

* 20 * 40 * 60 
LpMDHb : TTTGGTNCTTTTGCCGAGCGAGAAAGCTGTTCGGTGTCA^^ • 60 

* 80 * 100 * 120 
LpMDHb : TAAAACTTTCTACGCTGGGAAGGCAAACGTGCCAGTCACTGGGGTGAATC : 120 



* 140 * 160 * 180 
lipMDHb : TOGTGGCCATGCTGGTGTTACTATCCTGCCACAGTTCTOVCAGGCTACT : IBO 

it ^ 200 * 220 * 240 

LpMDHb ; TGCATTOTCCa^TGAGGACCTTAAGGCCCTCACCAAQAGGAa^CAAGATTC^ : 240 

* 260 * 280 * 300 
LpMDHb : AGTTGTTGAAGCAAAGQCTGGAAAGGGCTCAGOUVCATTGTCAATGGCATATGCTGGTGC : 300 



* 320 * 340 * 360 
LpMDHb : AGTATTTGGAGATGC^TOCTTGAAGGGGCTCAATGGAGTTCC^^ : 360 

* 380 * 400 • * 420 
LpMDHb : CTTTGTGCAATCAACCGTAACAGAGCTGCCATIOT^ - 420 

* 440 * 460 * 480 
LpMDHb : GAACGGAGTGGAGGAAGTGATTGGGCTGGGCGAGCTGTCTGCCTTCGAGAAGGAGGGTCT z 480 

* 500 * 520 * 540 
LpMDHb : GGAGAGCCTCAAQGGCGAGCTQimSNCCTCCATCGAGAAGGGTATCAAG^ : 540 



* 560 * 580 * 600 

LpMDHb : GAGCTAGTCAACCTGCTCAGATTCTAAGACTCCGOVCATQAACTOTGTGGGA 600 

. * 620 * 640 * 660 

LpMDHb : ATTTTTGGTACGACTCCTTT<y^CTGCCCCCTTCTCCTGGGGACATTGAGGCGTCGNGCTC : 660 



* 680 * 700 * 720 
LpMDHb : aVCAATAAAATGGCGTGNCnTOTTGCCMACTQAACT^ 720 

* 740 * 760 * 780 
LpMDHb : GAAACCCTOTGCCTTATGTACa^OlGTACGGTQAACCCGAAAATCATX^ : 780 



* 800 
LpMDHb : GATTCTGTGGAAGCTTTTTTCTTTTAN : 807 



^^^^re 15 Deduced amino acid sequence of LpMDHb 

* 20 * 40 * 60 

LpMDHb : IiXIiIiPSEKAVRCHHPXVVRAKTPYAGKANVPW : 60 



* 80 * 100 * 120 

LpMDHb : AIiSHEDLKALTKRTQDGGTEWEAKAGKGSATLSMAYAGAVFGDACLKGLNGVPDIVECS : 120 



* 140 * 160 * 180 

XipMDHb : FVQSTVTBLPFFASK\mLGK(7GVEEVIGLGELSAFEKEGLESLKGEiaXSIEKGIKFA : 180 



LpMDHb : S : 181 



■ure 16 Nucleotide sequences of nucleic ac:id fragments contributing to the 
»nsensus contig sequence LpMDHb 



20 



40 



60 



LpMDHbl : 
LpMDHb2 : 



TTTGGTNCTXTTG^CGAGBgAS|AAgCTGTTCGGTGTCACCACCCTTGNGTTGTTCGTGCT 
GCGAGAiAGCTGTTgGGTGTCACCACCCTOfGjllGTTGTTCGTGCT 



60 
44 



LpMDHbl 
LpMDHb2 



80 



100 



120 



CTTTCTACGCTGGGAAGGCAAACGTGCCAGTCACTGGGGTGAATGTTCCTGTTGTTG 

aaaactttctacgctgggaaggcaaacgtgccjggtcactggggtga^Itgttcctgttgttg 



121 
105 



IipMDHbl 
LpMDHb2 



140 



160 



180 



gtggccatgctggtgttactatcctgccac|ggttctcacaggctact?cctgcaagtaatgc 
gtggccatgctggtgttactatcctgccacagttctcacaggctactcctgcaagtaatgc 



LpMDHbl 
LpMDHb2 



200 



220 



240 



ATTGTCCCATGAGGAgCTTAAGGCCCTCACCAAGAGGACACAAGATdGTGGGACGGAAGTT 

attgtcccatgaggaccttaaggccctcaccaagaggacacaagatggtgggacggaagtt 



243 
227 



260 



280 



300 



LpMDHbl : 
LpMDHb2 : 



GTTGAAGCAAAGGCTGGAAAGGGCTCAGCAACATTGTCAATGGCATPlTGCTGGTGCAGTAT 
GTTGAAGCAAAGGCTGGAAAGGGCTCAGCAACATTGTCiATGGCATiiTGCTGGTGCAGTffiT 



304 
288 



LpMDHbl 
LpMDHb2 



320 



340 



360 



TTGGAGATGCATGCTTGAAGGGGCTCAATGGAGTTCCTGACATTGTAGAGTGCTCCTTTGT 
TTGGAGATGCATGCTTGAAGGGGCTCAATGGAGTTCCTGACATTGTAGAGTGCTCCTTTGT 



365 
349 



380 



400 



420 



LpMDHbl : 
LpMDHb2 : 



gcaatcaac||gtaacagagctgccattctttgcctccaaggtaaggatcggcaagaacgga 
gcaatcaaccgtaacagagctgccattctttgcctccaaggtaaggcItcggcaagaacgga 



426 
410 



440 



460 



480 



LpMDHbl : 
LpMDHb2 : 



gtggaggaagtgattgggctgggcgagctgtctgccttcgagaaggagggtctggagagcc 
gtggaggaagtgattgggctgggcgagctgtctgccttcgagaagg^Igggtctggagagcc 



: 487 
: 471" 



LpMDHbl 
LpMDHb2 



500 



520 



540 



tcaagggcgagctgntgmcctccatcgagaagggtatcaagttcgcgcaggagagctagtc 

TCAAGGGCGAGCTGfflTGfflcCTCCAT[SGAGAAGGGTATCAAGTTCGCil]cAGGAGAGCTAGTC 



548 
532 



560 



580 



600 



LpMDHbl : 
LpMDHb2 : 



AACCTGCTCAGATTCT.GACACTCCGigACATGAACTCGGTGGGATCTGATGAATTTTTGGTA 
AACCTGCTCAGATTCTAACACTCCGCACATGAACTCGGTGGGATCTGATGAATTTTTGGTir 



609 
593 



LpMDHbl 
LpMDHb2 



620 



640 



660 



CGACTCCTTTCgCTGCCCQ3TT:|TC'gTGGGGACATTGAGGCGTgGlNrG|CTiSCACA|^^^ 
CGACTCCTTTCACTGCCCCCTTCTCCTGGGGACATTGAGGCGTCG'feCTCCACAATAA?^ 



670 
654 



LpMDHbl 
LpMDHb2 



680 



700 



720 



GGCGTGl^TTGTTGHCATACTGAI^.CTGAgCTTgTAjiT^gL _____ 
GGCGTG5iCTTGTTGCCATACTGAACTGAACTTGTAATACCAGAAAGAiGTGAAACCCTGTGC 



708 
715 



740 * * 760 ♦ 780 * 

MDHbl : : 

LpMDHb2 : [4iihJAJcliikW<a4JAili^^ : 776 



LpMDHbl : 
LpMDHb2 ; 



800 



C TTTTTTCTTTTAN 



790 



^^^^gure' 17 N:ucleotide sequence o£ LpMDHc 

* 20 * 40 * 60 
LpMDHc : GNITGGTNTACCGAGCGCNCATACTTTNGTGGGTGAGGTTC^ : 60 

* 80 * 100 * 120 
IipMDHc : GTCy^ATGTTCCTGTNGNTGGCGGGCATGCCGGAGTTACMATATTGCCACTCCT^ : 120 

* 140 ♦ 160 * 180 
LpMDHc : GTTAATCCTCCCTGCTCa^TTCACCATCSAGGAAATTAGTATCTCACCTTCACAGCATACAG : 180 

* • 200 * 220 * 240 
LpMDHc : AATGGTGGGACaiGAAGTNGTCGAGGCGAAAGCTGGAGCAGGATCGGNNACTOT : 240 



* 260 * 280 * 300 
LpMDHc : GCGNATGCGGCAGCTAAATTTGCAGATGCTTGCTNGAGAGGATTGCATGGTGATGCTGGG- : 3 00 

* 320 * 340 * 360 
LpMDHc : ATAGNGGANTGCTCTTATGTGGATTCTCAGGTGACGGANCTSrrCXTTKr^ : 360 



* 380 * 400 * 

LpMDHc ! GTTCGCCTTGQTTGTTCTGGCQTCSMAGGAGATCTTGCCACro s 419 



^^^^re 18 Deduced amino acid sequence of LpMDHc 

* 20 * 40 * * 60 
LpMDHc : XGXPSXHTXVGBVLGXDPRDVNVPXXGGHAGVXILPLIiSQVNPPCSFTMRKLVSm : 60 

* 80 * 100 * 120 
LpMDHc : NGGTEXVEAKAGAGSXTXSMAXAAAKFADACXRGLHGDAGIXXCSYVDSQVTXXSXFASK : 120 



LpMDHc : VRLGCSGVXEILPLGPLNE : 139 



^^iure 19 Nucleotiae sequence o£ LpMDHd 



* - • 20 * 40 * 60 

LpMDHd : GITGNTTCCGCCAACACAACACCACCGCTCCCCCGTCCGCATCTCTCCCTTTCGCCTCCAT : 60 



* 80 * * 100 * 120 

LpMDHd : CGATCCAGATCCCACACACCGCCGCAGCCAGCAACGATGAGGCCGTCGGC6ATGAGATCC : 120 



* 140 * 160 * 180 

LpMDHd : GCCGCGCAGCTCCTCCGCCGCCGCAGCTACTCGTCCGCGTCCGGCCAGCCGGAGCGGAAG : 180 



* 200 * 220 * 240 

LpMDHd : GTGGCCATCCTCGGCGCGGCCGGCGGGATCGGGCAGCCGCTGGCGCTCCTCATGAAGCTG : 240 



* 260 * 280 * 300 

LpMDHd : AACCCGCTCGTCTCCTCCCTCTCCCTCTACGACATCGCCGCCACCCCCGGCGTCGCCQCC : 300 



* 320 * 340 * 360 

LpMDHd ! GACGTCTCCCACATCAACTCCCCGGCCCTGGTGAAGGGGTTCATGGGCaACGATCAGCTC : 360 



* 380 ♦ 400 * 420 

LpMDHd : GCGQAGGCGTTGGAGGGGGCCGACCTCGTCATCATCCCGGCCGQCGTTCCGAGGAAGCCC : 420 



* 440 * 460 * 480 

LpMDHd : GGO^TGACCAGGGACGATCTCTTCaUVCaiTCAACGCCGGCATCGTTAAG^ : 480 



* 500 * 520 * 540 

LpMDHd : GCCATCGCaU\GTACnX3CCCCaUVCGCTCTTATaUVCATC : 540 



* 560 * 580 * 600 

LpMDHd : ACTGTTCCAATTGCTGCTGAAGTTTTCa^GAAGGCTGGAACCTATGATGAGAAGAAGTTG : 600 



* 620 * 640 * 660 

LpMDHd : TTTGGTGTCACCACTCTTGATGTTGTTCGTGCO^GGACTTTCTATGCTGGGAAGGCT : 660 



* 680 * 700 * 720 

LpMDHd : GTACCroTTACTGGTGTGAAOSTTCCTGTTGTTGGTGGTCATGCTGGT^ : 720 



* 740 * • 760 * 

LpMDHd : CCa^CTGTTCTCACyVGGCyyVCTCCTTCGACTAATGCATTGTCTAGTG/^ : 774 



^l^^re 20 Deduced amino acid sequence of IipMDHd 



* 20 * 40 * 60 

IipMDHd z XXPPTQHHRSPVRISPFIaJHRSRSHTPPQPATMRPSA^mSAAQLLRI«lSYSSASGQP : 60 



* • 80 * 100 * 120 

LpMDHd : VAILGAAGGIGQPIiALLMKIjWPLVSSLSIjYDIAATPQVAADVSHINSPALV^ : 120 



* 140 * 160 * 180 

LpMDHd *: AB^^EGADLVIIPAGVPRKPGMTMDDLFNINAGIVKNLCTAIAKYCPK : 180 



* 200 • * 220 * 240 

LpMDHd : TVPIAAEVFKKAGTYDEKKLFGVTTLDVYRARTFYAGKANVPVTGVl^ : 240 



LpMDHd : PLFSQATPSTNALSSEDX : 258 



^^^ure 21 Nucleotide sequence of LpMDHe 

* 20 * ' 40 * 60 

LpMDHe : TCCGTACNATTGCTGCTGAAGTATTTAAAA7UVGCTGGGACATACAATCCTAAGA<^ s 60 



* 80 • * 100 * 120 

LpMDHe : TGGGGGTGACyVACACTTGATGTAGTGAGAGCCAATACTITTGTGGGTGAGGTTCTTGGAC : 120 



* 140 * 160 * 180 

LpMDHe : TTGACCCCAGAGATGTCS^TGTTCCTGTTQTTGGCXSGGCATGCCGGAGTTACG^ : 180 



* 200 * 220 * 240 
LpMDHe : CACTCCTTTCGCAGGTTAGTCCTCCCTGCTCGTTCaVCCCCTGAGGAAATT^ : 240 

* 260 * 280 * 300 
LpMDHe : CCTCACGCATACAGAATGGTGGGACAGAAGTTGTGGAGGCQAAAGCAGGAGCAGGATCGQ : 300 



* 320 * 340 * 360 

LpMDHe : CAACTCTTTCTATGGCGTATGCGGCAGCTAAATTTGC^^ : 360 



* 380 * 400 * 420 

LpMDHe : ATGGTGATGCTGGGATAGTGGAGTGCTCTTATGTGGATTCTCAGGTOACCGGJ^ : 420 



* 440 * 460 * 480 
LpMDHe : TCTTTGCATCCAAAGTTCGCCTAGGTCGTTCTGGCGTCGAGGAGATCTTGCAACTTGGGT : 480 

* 500 ' * 520 * 540 
LpMDHe : CCACTOAACCaUSGTTTTGAAAGANCIKSGACTGGAANAAGGCGAA;^^ : 540 

* 560 * 580 * 600 
LpMDHe : AQAGCCTTCCaVGAAAGGimSTGTCATTTCGTNCAACAAAGTGAGra : 600 



* 620 * 640 * 660 
LpMDHe : TTGTTGGATGTGCTTCCCCAAAGTTCC^U^Ca^CACCGTCGNAATTGGCATATANAT^ : 660 

* 680 * 700 * 720 
LpMDHe : TGGTTTGGGGCCTTTTGCan^TNATGCaUUVCAQC^ ! 720 



* 740 * 760 * 780 

LpMDHe : NTGAAAAACTCrTAACATTTTTlTTTACGGTT^ s 780 



* 800 * 

LpMDHe : ANTATATGATAAimaAANAAAGTTl^AAAAAAAAN : 816 



re 22 Deduced amino acid sequence of LpMDHe 

* 20 * 40 * 60 

LpMDHe : RXIAAEWKKAGTYNPKRLLGVTTLDVVRAOTPVGEVM : 60 



* 80 * 100 * 120 

LpMDHe : LLSQVSPPCSFTPEEXSYLTSRIQNGGTEVVEAKAGAGSATLSiyiAYiU^AKFiVDACIiRG^ : 120 



* 140 * 160 * 180 

LpMDHe : GDAGIVECSYVDSQVTGTAFPASKVRLGRSGVEEILQLGSTEPGFBRXGLEXGEXXSYPE : 180 



* 200 * 220 * 240 

LpMDHe : SLPERXCHFXQQSELHAIIFVGCASPKFQHTVXIGIXILLVWGLLXXCKQATXWVGGVRX : 240 



* 260 * 

LpMDHe : EKLLTFPFTVXNKXXEKPEXYMIXBXSXXKK : 271 



xe 23 Consensus contig nucleotide sequence of LpMDHf 

* . 20 * 40 * 60 
LpMDHf : GGGATGATTNAO^CAACAAAAATGCTGGGCATTGTCCGATG^ATCTOTGAGQGCGl^ : 60 

* 80 * 100 * 120 
LpMDHf : AAGAGCTGTCCTAATGOU^TAGTGAATTTGATCAGCAACCCTGTGAACTCAACTGTCCCC : 120 

* 140 * 160 * 180 
LpMDHf : ATTGCGGCT^GTAGNTTTCAAGAGOGCTGGAACTTACTGCCCCAAAOT i 180 

* 200 * 220 * 240 
LpMDHf : ACAACTCTTGATGTAGCGAGGGCTAACACCTTTGTGGCTGAAGTGC^ : 240 

* 260 * . 280 * 300 
LpMDHf : AGAGAAGNCAGTGTTCCGGOTGTTGGCGGGCATGCAGGGATCACTATATTGCCCCTCCTG : 300 

* 320 * 340 * 360 
LpMDHf : NCCCAGGTCAGCCCCCCGTGCTCATTCACTCCAGATGAAATaVGCTAT^^ : 360 

* 380 * 400 * 420 
LpMDHf s ATACAGAATGGCGGTACCGAAGTTGTTGAGGCAAAGGCTGGAGCAGGCTOTGC^ : 420 

* 440 * 460 * 480 
LpMDHf : TCAATGGCTTTTGCTGCTGCAAAATTCGCCGATGCATGCTTGCGTGGAATGCGTC : 480 

* 500 * 520 * 540 
LpMDHf : GCTGGCATTGTGGAATGTNCATAOSTTGCATCTGAGGTGACAGAGCT^ : 540 

* 560 * 580 * 600 
LpMDHf : ACAAAAGTGAGGTTAGGTCGTGGCGGAGCTGAGGAGATCCTCCCTCTTGGGCC^ : 600 

* 620 * 640 * 660 
LpMDHf : GACTTTGAGAGAGC^GCCTGGAGAAGGCGAANAAGGAGCTCAGCGAGAGCATCCAGAAG : 660 

* 680 * 700 * 720- 
LpMDHf : 6GTGTGGCGTTCATGAACA?VGTGAGATCATATGAATGGATGGATACCCCGC^ : 720 

* 740 * 760 * 780 
LpMDHf : ATAGATGATGCTUUVGACTAAAGAAAGAGTOTGATATAGTGCTCCTATATA^^ ; 780 

LpMDHf : CTCTCCTGCCTGTAAGAA : 798 



^^^ure 24 Deduced amino* acid sequence of LpMDHf 

* 20 * 40 * ' 60 
LpMDHf : MIiGIVRSICEGVAKSCPNAIVlSn:iISNPVNSTVPIAAEXPKI^ : 60 

* 80 * 100 * 120 
LpMDHf 2 ANTPVZVEVLGXDPREXSVPXVGGHAGITILPLLXQVSPPCSFTPDEISYLTNRIQNGGTE ; 120 

* . 140 * 160 * 180 
LpMDHf : VVEAKAGAGSATLSMAFAAAKFADACniRGMRGDAGrVECXWASEVTELPFF^^ : 180 

* 200 * 220 
LpMDHf : GGAEEIIiPLGPLMDFERAGLEKAXKELSESXQKGVAFMNK : 220 



^Biire 25 Nucleotide sequences of .nucleic acid fragments contributing to the 
^^msensus contig sequence LpMOHf 

* 20. * 15 * 

IipMDHf2 : " ftfgrAiiMef±^jf^f^f|Vi|aft|4|liW£yiy£TA^^^ 

* . 80 * 100 * 120 

■ ■■ I B M A \ _LJ t . \ .LR M ,J^ t ^iM.l^^^^ ^ : 120 

LpMDHf2 : yAVAWAX«#iiMeyif*tWATiWtLef»»iT£%if^ciiW,T,vw 

* 140 * 160 * 

I Li ii i i III I M l I l _l I I L I UjLlif f Jiilii^ l Jii f r ^ ^ : 180 

LpMDHfl : . 
IipMDHf2 : fgBBBB5SggrATcf*TA\otMMiywAy£^^^^ 

* 200 * 220 * 240 

,LjL mii i mM j ! i^ m j ifM ^ : 240 

LpMDHfl : gjjepijys^l^^ . 238 

* 260 * 280 * 300 

LpMDHfl : ^gggggg|^Bg|g[|g^^ . 
LpMDHf2 : ^^^^^^*fi«*ra««iwwi««ww«^fiiS^^ 

* 320 * 340 * 360 

LpMDHfl : gaaBBSggg^ggg^^ . 

* 380 * 400 * 420 

LpMDHfl : yyggglgj^^ , 
LpMDHf2 : KMmarAX«i»*iiii««««wifAW««ATAiwiiiitcMy^^^ tfl11'1^MTm¥T-i i 

* 440 * 460 * 480 

I ■ ■ ■ 1 I I i -ijj j. i .j,iJ.j^i^ i .J^i^Lt.l,XXJ.JJ^JjaXJ>l«^AJJJ.»L J>^WeiiJ^^^ : 480 

LpMDHfl : ijlSSSISaSSSK^^ . 478 

LpMDHf2 : |iir«™iMCtctfBMMiiiKCt€iiitm«iwM«r«rATA^Iil^^^ r« mmFWin Tl ft T1 

* 500 * S20 * 540 

.1 ^ \ .j.J^lM.AM^l^J^MLJJ.J^M^J.^M ^ : 540 

LpMDHfl : MgHgaa^ . 497. 

LpMDHf2 : MMKMCiaMiMitciiiLCicr^iWimiiiai - 

* 560 * 580 * 600 

LpMDHfl : kSJI^mAMm^Tt^^MWSSLifif^mmmummim 

LpMDHf2 : " 

* 620 * 640 * 

LpMDHfl : «f£tMiMiiK€m«^WAT«aHij«^w«™«i»™feTOgc«P^^ n ■! i 

LpMDHf2 : • ' 

680 700 * 720 

LpMDHfl : hiMiirAiWi!f rf giiiliJ«M^ 

LpMDHf2 




740 



760 



780 



IjpMDHf2 : 



rAAAGAAAGAGTGTGATATAGTGCTCCTATlATACCTGTAAAft^ 



LpMDHfl : 
LpMDHf2 : 



CTCTCCTGCCTGTAAGAA 



798 



^^^^re 26 Nucleotide sequence o£ LpMDHg 

* 20 * 40 * 60 

LpMDHg : CAATTGCACGTTCTTGCTCACTTO^GCATCACCCTCACGCTTCrCC^^ : 60 



* 80 * 100 * 120 

IjpMDHg : CAACCGTCACTATGGTCAAGGCTGTCGTCGCAGGTGCTGCTGGTGGTATCGGGCAGCCCC : 120 



* 140 * 160 * 180 

LpMDHg : TCTCTCTTCTACTCS^GACGAGCCCCCTCATCGATGAGCTTGCCCTCTACGATGTTGTCA : 180 



* 200 * 220 * 240 

LpMDHg : ACACTCCCGGTGTTGCCGCTGATCTTTCCaVCATCTCATCCCGOSCTCAAATCGCC^ : 240 



* 260 * 280 * 300 
LpMDHg : ACCTCCCCAAGGATGATGGCGCAAAGGCTGCATTCAAAGATGCCGACATTATCGTCATCC : 3 00 

* 320 * 340- * ' 360 
LpMDHg : CCaCCGGCaVTTCCTCGa^GCCTGGCATGACCCGTGATGACCTCTTC^ J 360 



* 380 * 400 * 420 

LpMDHg : GAATTOTGAAGGGTCTGATTGAGGTTGCCQCCGAAGTTGCCCCCAA^^ : 420 



* 440 * 460 * 480 

LpMDHg : TCATCTCCAACCCTCTO^CTCTACCGTCCCTATCTCTGCCGAGGTCCTCAAGGCCAAGG : 480 



* 500 * 520 * 540 

LpMDHg : GCGTCTTCAACCCTCAGCGTCTTTTCGGTGTGACCACCCTaaAC^ : 540 



* 560 * 580 * 600 

LpMDHg : CTTTCGTCGCCAGCATCACCGGOSAGAAGCAGCCCCaVGA^ J 600 



* 620 * 640 * 660 

LpMDHg : GCGGCCACTCCGGCGAGACCATCGTCCCGCTTTTCAGCAAGGNTCaVGCCCTCTGCTTNCA : 660 



LpMDHg : TTCCCGC : 667 



^^^^re 27 Deduced amino acid sequence o£ LpMDHg 

* 20 * 40 * 60 

LpMDHg : IARSCSLQHHPHASPTQPLPT\mWKAWAGAAGGIGQPLSLIiI^ : 60 



* 80 * 100 * 120 

LpMDHg : TPGVAADLSHISSiy^QIAGYLPKDDGAKAAFKDJUDIIVIPAGIPRKPGMTRDDLPNINAG : 120 



* 140 * 160 * 180 

LpMDHg : IVKGLIEVIU^APKAPILVISNPWSWPISAEVLKAKGVFNPQRLFGV^ : 180 



* 200 * 220 

LpMDHg : FVASITGEKQPQNLTVPVIGGHSGETXVPLFSKXQPSAXIP : 221 



^^^^Lre 28 Consensus contig nucleotide sequence of LpMDHh . 

* 20 * 40 * 60 

LpMDHh : TNACGGAGCTGCTTAAATCAGCCCCCATTCCGCCTCGTCTATAGCGATCCTTC^^ : 60 



* 80 * 100 * 120 

LpMDHh : TGTCGTCGCCTCCTCCCGAACCACTCTCCCCATCCCCGAACTCCAGAACCGGCTCCAATG : 120 



* 140 * 160 * 180 

LpMDHh : GCGGCGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCT : 180 



* 200 * 220 * 240 

LpMDHh : CTTGTTCCGATGATTQCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGra : 240 



* 260 * 280 * 300 

LpMDHh : ATGCTGGATATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTGGTTGAT : .300 



* 320 * 340 * 360 

LpMDHh : GCCGCy^TTTCCACITCTCAAGGGAGTTGTTGCAAa^ : 360 



* 380 * 400 * 420 

LpMDHh : GGTGTGAATGTTGCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAAG : 420 



* 440 * 460 * 480 

LpMDHh : GATGTTATGTCTAAGAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCAT : 480 



* 500 * 520 * 540 

LpMDHh : QCAGCCCCGAATTGGAAGGTTCTGGTTGTTGCCAATCCAGOUUVCACC^ : 540 



* 560 * 580 * 600 

LpMDHh : TTiUU^GGAGTTTGCTCCATCTATTCCTGAGAAGAACATa^TO : 600 



* 620 * 640 * 660 

LpMDHh : CATAACAGGGCACTTGGTCAGATCTCTGAGAGACTTGATGTCCAAGTTAGXGATGTGAAG : 660 



* 680 * 700 * 720 

LpMDHh : AATCTTATCATCTGGGGCaUVTCACTCnrrCCAGTCAGTACCCTGATGTC^ : 720 



* 740 * . 760 * 780 

LpMDHh : GTGAAGACTTCCAGTGGOSAGAAGCCTGTTCGaSAACTT^^ : 780 



- * 800 * 820 * 840 

LpMDHh : AATGCAGGGTTCATTGCCACTGTCCAGCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAG : 840 



* 860 * 880 *• 900 

LpMDHh : CrCTCCAGTQCTCTCTCTGCTGCCAGCTCTGCTTGTGACCA(»TCCGTGATTGGG : 900 



* 92 0 * 940 * 960 

LpMDHh : GGiUVCCCCTGAGGGAAOVTTTGTTTCaVTGGGTGTGTATTCTGATGGTTCATACGGTGTG : 960 



W * 980 * 1000 * 1020 

ImDHIi : CCTGCTGGGCTTATCTACTCCTTCCCaVGTAACTTGCTGCGGTGGTGAATGGACAAT^ : 1020 



* 1040 * 1060 * 1080 

LpMDHh : aUVGGGCrCCaSATCGACGAGTTCTaVAGAAAGAAGATGGATGCa^^^ : 1080 



* 1100 * 1120 * 1140 

LpMDHh : TCGGAQGAGAAGGCTCTCGCCTACTCGTGCCTOSAGTAACTGCATACCAGGGAGCAGCTC : 1140 



* * 1160 * 1180 * 1200 

IjpMDHh : CCGCTCTGATGTTTTGAATAAAAGGAACATTTTGGCTCCATGAAACTCATCTCCACTCAG : 1200 



* 1220 * 1240 * 1260 

LpMDHh ; AACAGTTGCACaTCGCGGTGCCTTTAGCTGGTTTTTCCS^TGTGTATG^ : 1260 



* 1280 * 1300 * 1320 

. LpMDHh : GTAGCTCTATTTTCGCCri^TGATTTAC^ : 1320 



* 1340 * 1360 * 1380 

LpMDHh : TTTGACGTCTGATTAAAACCAACCTCTTATTATTCCTGTGTGTATGAATGAGGCrr^^ : 1380 



* 1400 * 1420 * 1440 

LpMDHh : AGCTCTATTTTOaCCTGATQATTTACAGGCCT^TGATATTGGCAGGAG^ r 1440 



* 1460 * 1480 

LpMDHh : TGACGCCTGATTAAAACO^CCTCTTATTACTAAAAAA^^ : 1484 



^^t^re 29 Deduced amino acid sequence of LpMDHh 

* 20 * 40 * 60 • 
LpMDHh : MAAKEPMRVLVTG;VAGQIGYAI.VPMIARGIMIiGMQPVIIiH^ : 60 

* 80 * 100 * 120 
LpMDHh : DAAFPLLKGVVATTDWEACTGVljnrAVMVGGFPRKEGMERKD^ : 120 

* 140 * 160 * . 180 
LpMDHh : HJ^NCKVLVVANPAimiALILKBFAPSIPBKNISCIiT^ : 180 

* 200 * 220 * 240 
LpMDHh : KCrVIIWGiraSSSQYPDViraATVKTSSGEKPVRELVKDD^ : 240 

* 260 * 280 * 300 
LpMDHh : KLSSALSAASSACDHIRDWVLGTPEGTFVSMGVYSDGSYGVPAGLIYSFPVTCCGGEWTI : 300 



* 320 * 

LpMDHh : VQGLPIDEPSRKKMDATAQELSEEKALAYSCLE : 333 



lure 30 Nucleotide sequences of nucleic acid fragments contributing to the 
insensus contig sequence LpMDHli 



LpMDHhl 

LpMDHh2 

LpMDHh3 

XipMDHh4 

IjpMDHhS 

LpMDHhG 

LpMDHh? 

LpMDHhB 

LpMDHhd 

LpMDHhl 0 

IipMDHhll 

LpMDHhl2 

IipMDHhlS 

LpMDHhl 4 

LpMDHhl 5 

LpMDHhl 6 

LpMDHhl? 

LpMDHhl 8 

LpMDHhl 9 

LpMDHh20 

LpMDHh21 

LpMDHh22 

LpMDHh23 

LpMDHh24 

LpMDHh25 

LpMDHh26 

LpMDHh27 

LpMDHh28 

LpMDHh29 

LpMDHh3 0 

LpMDHhB 1 

LpMDHh32 

LpMDHhB 4 

LpMDHhB 5 

LpMDHhB 6 

LpMDHhB? 

LpMDHh3a 

LpMDHhB 9 

LpMDHh40 

LpMDHh41 

LpMDHh42 

LpMDHh43 

LpMDHh44 

LpMDHh45 

LpMDHh46 

LpMDHh47 

LpMDHh48 

LpMDHh49 

LpMDHhB 0 

LpMDHhSl 

LpMDHh52 

LpMDHhSB 

LpMDHh54 

LpMDHhSS 

LpMDHhB 6 

LpMDHhB? 

LpMDHh58 

LpMDHhB 9 

LpMDHh60 

LcM}Hh61 

LpMDHh62 

LpMDHh63 

LpMDHh64 



TiSTACGGAGC 



20 




lipMDHha 

IjpMDHh3 

XipMDHh4 

IipMDHhS 

LpMDHhe 

IjpMDHh7 

LpMDHhS 

IjpMDHh9 

IipMDHhlO 

LpMDHhll 

I,pMDHhl2 

LpMDHhl3 

LpMDHhl4 

LpMDHhlJS 

IipMDHhl6 

LpMDHhl? 

lipMDHhlS 

I,pMDHhl9 

IjpMDHb20 

XipMDHh21 

LpMDHh22 

LpMDHh23 

LpMDHh24 

IipMDHh25 

I.pMDHh26 

LpMDH)l27 

IipMl)Hh28 

IjpMDHh29 

LpMDHlxBO 

LpMDHhS 1 

LpMDHh32 

XjpMDHh34 

IipMDHh35 

I«pMDHh36 

IipMDHh37 

LpMDHh38 

LpMDHhS 9 

LpMDHh40 

LpMDHh41 

LpMDHh42 

LpMDHh43 

LpMDHh44 

LpMDHh45 

LpMDmi46 

IipMDHh47 

LpMDHh48 

LpMDHh49 

LpMDHhS 0 

LpMDHhS 1 

LpMDHhS 2 

LpMDHhS 3 

LpMDHh54 

LpMDHhS 5 

LpMDHhS 6 

LpMDHhS 7 

LpMDHhS 8 

LpMDHhS 9 

LpMDHhe 0 

LpMDHhe 1 

LpMDHh62 

LpMDHhe 3 

LpMDHhe4 



100 



120 



TCGTCGCCTCCTCCCGAACCACTCTCCCCATCCCCGAACTCCAGA^CCGGCTCCAATGGCGG 
TlGBcil^CTicicSSBAACCACTCTCCCCAiCCCCGAACTCCAGAi^CCGGCTCCAATGGCGG 
ilgScS^CTCCTC^^CCACTCTCCCCATCCCClAACTCCAGAACCGGCTCCAATGGCGG 
iG%CGCCTCCTCCCGBASSACN,CTCCCCATCCCCGAACTCCAGA;^CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGSAr£ACTCTaCCCi|CCCCGAACTCCAGA^CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGAicCACTCTCCCCqTCCCCGAACTCCAGAJj.CCGGCTCCAATGGCGG 
TCGBc^CTCCTCCcBGACCACTCTCCCCATCCCCGAACTCCAGAJj.CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGM^CCACTCTCCCCATCCCCGAACTCCAGAJ=vCCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGAACCACTCTCCCCATCCCCGAACTCCAGAijLCCGGCTCCAATGGCGG 

tcgtcgcctcctcccgaaccactctccccBtccccgaactccaga^ccggctccaatggcgg 

TCGTCicCTCCTCCCGAACCACTCTCCCCATCCCCGAACTCCAG7U=.CCGGCTCCAATGGCGG 

tcgtcgcctcctcccgHaccactctccccatccccgaactccagpjj.ccggctccaatggcgg 
tcgtcgcctcctcccgIaccactctccccatccccgaactccaga^ccggctccaatggcgg 
tcgtcgcctcctcccgSaccactctccccBtccccgaactccaga^ccggctccaatggcgg 

TCGTCGCCTCCTCCCGiACCACTCTCCCCATCCCCGAACTCCAGA;f.CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGlACCACTCTCCCCATCCCCGAACTCCAGAiJ.CCGGCTCCAATGGCGG 

tcgtcgcctcctcccgIaccactctccccatccccgaactccaga^ccggctccaatggcgg 
tcgtcgcctcctcccgIaccactctccccatccccgaactccaga^ccggctccaatggcgg 

TCGTCGCCTCCTCCCGBACCACTCTCCCCATCCCCGAACTCCAGAi!.CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGHACCACTCTCCCCATCCCCGAACTCCAGAiiLCCGGCTCCAATGGCGG 

tcgtcgcctcctcccgIaccactctccccatccccgaactccaga^ccggctccaatggcgg 
tcgtcgcctcctcccgIaccactctccccatccccgaactccaga^ccggctccaatggcgg 

TCGTCGCCTCCTCCCG|ACCACTCTCCCCATCCCCGAACTCCAGAi|.CCGGCTCCAATGGCGG 
TCGTCGWCT'i5cTCCCGlACCACTCTCCCCATCCCCGAACTCCAGA;j.CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGAACCACTCTCCCcBTCCCCGAACTCCAGAijLCCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGAACcBCTCTCCCCATCCCCGAACTCCAGAiJ.CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGAACCACTCTCCCCATCCCCGAACTCCAGA^CCGGCTCCAATGGCGG 

tcgtcgcctcctcccgBaccactctccccatccccgaactccagajJlCcggctccaatggcgg 

TCGTCGCCTCCTCCCGfACCACTCTCCCCATCCCCGAACTCCAGAJJ.CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGlACCACTCTCCCCATCCCCGAACTCCAGAi}.CCGGCTCCAATGGCGG 

tcgtcgcctcctcccgIaccactctccccatccccgaactccagaaccggctccaatggcgg 

TCGTCGCCTCCTCCCGAACCACTCTCCCCATCCCCGAACTCCAGAJj.CCGGCTCCAATGGCGG 

tcgtcgcctcctcccgBaccactctccccatccccgaactccaga^ccggctccaacggcgg 

TCGTCGCCTCCTCCCGAACCACTCTCCCCfEiSjCCCCGAACTCCAGAiiLCCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGAACCACTCTCCCC;TO:cCCCGAACTCCABAa.CCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGAACCACTCTCCCcilMCCCCGAACTCCAGAACCGGCTCCAATGGCGG 
TCGTCGCCTCCTCCCGBA®ciCTCTCCCCATCCCCGAACTCCAGA7|.CCGGCTCCAATGGCGG 

ItcgtngcctSctcccgaaccactctcccc|tccccgaactccagaaccggctccaatggcgg 
tcgtcgcctcctcccgBaHc^ctctccccatccccgaactccaga^^ccggctccaatggcgg 
tcgtcgcctcctcccgaacIBctctccccatccccgaactccagaaccggctccaatggcgg 
tcgtcgcctcctcccgaaccIctctccccatccccgaactccBgaaccggctccaatggcgg 
jTCGTCGCCTCCTCCCGBACCACTCTCCCCATCCCCGAACTCCAGA^iCCGGCTCCAATGGCGG 
tcgtcgcctcctcccgIaccactctccccatccccgaactccagaaccggctccaatggcgg 
tcgtcgcctcctcccgIaccactctccccBtccccgaactccagaaccggctccaatggcgg 
tcgtcgcctcctcccgaaccactctccccBtccccgaactccagaaccggctccaatggcgg 



Itcgtcgcctcctcccgaaccactctccccatccccgaactccagaaccggctccaatggcgg 

iHaSEHBB^HG'SACCBCTCTCCCCATCCCCGAACTCCAGNA^ 

WBWWWWBBaiaiM^^MHigcCCCGlAMTCCAB^^ACCGGCTCCAABGGCGG 



140 



160 



180 



IpMDHhl 
LpMDHh2 
IipMDHh3 
XipMDHh4 
LpMDHhS 
LpMDHhe 
. LpMDHh? 
IipMDHhe 
IipMDHhS 
IipMDHhlO 
IjpMDHhll 
rjpMDHhl2 
LpMDHhlS 
Iip^4DHhl4 
IipMDHhlS 
IipMDHhl6 
LpMDHhl? 
LpMDHhlS 
LpMDHhl 9 
IipMDHh20 
IipMDHh21 
IipMDHh22 
LpMDHh23 
LpMDHh24 
LpMDHh25 
IipMDHh26 
IipMDHh27 
IipMDHh28 
I.pMDHh29 
IjpMDHh30 
LpMDHhSl 
. I.pMDHh32 
IipMDHh34 
' IipMDHh35 
XipMDHh36 
IjpMDHh37 
LpMDHh38 
X<pMDHil39 
IipMDHh40 
IipMDHh41 
IipMDHh42 
IipMDHh43 
LpMDHh44 
Ijp]yiDHh45 
IipMDHh46 
IipMDHh47 
LpMDHh48 
IipMDHh49 
lipMDHhSO 
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CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGffiAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGWAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CG7UVGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCG.TCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 
CGAAGGAACCGATGCGCGTGCTCGTCACCGGCGCCGCAGGACAAATTGGATATGCTCTTGTT 

cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
c^aaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
Sgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacanattggatatgctcttgtt 
cSaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaaggaaccgatgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
cgaag naaccgaBgcgcgtgctcgtcaccggcgccgcaggacaaattggatatgctcttgtt 
HBBBgSaccgatgcgcgtgctcgtcaccggcgccgcaggwcaaattggatatgctcttgtt 



IipMDHh2 
LpMDHh3 
LpMDHh4 
LpMDHhS 
LpMDHhe 
LpMDHh? 
LpMDHha 
lipMDHhS 
LpMDHhlO 
LpMDHhll 
IipMDHhl2 
IipMDHhl3 
IipMDHhl4 
LpMDHhlS 
LpMDHhie 
LpMDHhl? 
LpMDHhie 
LpMDHhl 9 
LpMDHh20 
LpMDHh2 1 
IipMDHh22 
LpMDHh23 
LpMDHh24 
.LpjMDHh25 
•LpMDHh26 
LpMDHh27 
LpMDHh28 
IipMDHh29 
' LpMDHhS 0 
LpMDHhS 1 
.LpMDHhS 2 ; 
LpMDHh34 : 
•LpMDHhS 5 J 
LpMDHhS 6 : 
LpMDHhS? ; 
LpMDHhS 8 : 
LpMDHhS 9 : 
LpMDHh40 : 
LpMDHh41 : 
LpMDHh42 : 
LpMDHh43 : 
LpMDHh44 : 
LpMDHh45 : 
. LpMDHh46 : 
.LpMDHh47 : 
LpMDHh48 : 
LpMDHh49 ; 
LpMDHhS 0 : 
LpMDHhS X : 
LpMDHhS 2 : 
LpMDHhS 3 : 
LpMDHhS4 : 
LpMDHhSS : 
LpMDHhS 6 : 
LpMDHhS? : 
LpMDHhSS : 
LpMDHhS 9 : 
LpMDHheO : 
LpMDHh61 : 
LpMDHh62 : 
LpMDHheS : 
LpMDHh64 : 



CCGATGATTGCTAGGGGAAT^ 

CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTTATTCT.GCATATGCTGGA 
CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTTATTCTGCATATGCTGGA 
CCGATGATTGCTAGG|l|AATTATGCTTGGiG|GiAC^AGCCTGTTATTlTGCATATSBBS 
CCGATGATTGCTAGGGGAATTATGCTTGGTGgGGACCAGCCTGTTATTCTGCATATGCTGGA 
CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTTATTCTGCATATGCTGGA 
CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTTATTCTGCATATGCTGGA 

ccgatgattgctaggggaattatgcttggtgcggaccagcctgttAttctgcatatgctgga 

CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTTATTCTGCATATGCTGGA 

ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttIttctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttIttctgcatatgctgga 



ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaa-ttatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttAttctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 



ccgatgattgctangggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 

CCGATGATTGCTAGGGGAATTATGCTgGGTGCGGACCAGCCTGTTATTCTGCATATGCTGGA 

ccgatgattgcta^igggaattatgcttggtgcggaccaIcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgca:gga 
ccgatgattgctaggggaattatgcttggtgcggaccagcc|gttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 
ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 

CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTTA.TTCTGCATATGCTGGA 

ccgatgattgctaggggaattatgcttggtgcggaccagcctgttattctgcatatgctgga 

CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTTATTCTGCATATGCTGGA 

CCGATGATTGCTAGGGGAATTATGCTTGGTGCGGACCAGCCTGTT^iTTCTGCATATGCTGGA 
CCGATGATTGCTAGGGG^ 

^^^^^^^^BSI^^^BtatgcttggtgcggHccagcctgtt4.ttctgcatatgctgga 



246 
212 
210 
202 
206 
204 
202 
20S 
205 
203 
204 
202 
201 
202 
202 
202 
201 
202 
202 
201 
202 
202 
201 
201 
201 
200 
201 
199 
199 
200 
199 
200 
199 
199 
198 
199 
198 
198 
197 
197 
196 
197 
196 
195 
196 
193 
189 
192 
170 
153 
119 
41 
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LpMDHh26 

LpMDHh27 

LpMDHh28 

LpMDHha 9 

LpMDHh30 

LpMDHh31 

LpMDHh32. 

LpMDHh34 

LpMDHh35 

LpMDHh36 

LpMDHh37 

LpMDHh38 

LpMDHhS 9 

LpMDHh40 

LpMDHh41 

LpMDHh42 

LpMDHh43 

LpMDHh44 

LpMDHh45 : 
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LpMDHh47 ! 

LpMDHh48 : 
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LpMDHhS 0 : 
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LpMDHh64 : 



TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTGbTTGATGCCGiATSS 
TATT^CACCAGCTGCTGAAGCTCTTAATGG'- - "-i^'^t^^^^^AGCCGWATTTN 



.gGTTAAS3ATGGA|lTGilTBWGGCgGCMT.gc?^il 



TAT.CCACCAGCTGCTGAAGCTCXT^^^^^^^ 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTGbTTGATGCCGcJ^S? 

™^^^'''=^^^^°^^^^°^^c^'^^t°°t°ttaagatggagttgSS?g^ 

^^^^^<=^^<=C^«^PCTGAAGCTCTTAATGGTGTTAAGATGGAGTTGbTTGAiGcSSSTC 

tattccaccagc;ggctgaagctcttaatggtgttaagatggagttg'gttStc?cgcmttc 

J™^^^^^°^t°ctgaagctcttaatggtgttaagatggagttg'gt?Stc?cgS^^^ 

tattccaccagctgctgaagctcttaatggtgttaagatggagttg'gttgatgcScaJS? 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTdGTxSTGcScSSr 

™'^^^*'^°=^^'=^^°^°^^^^™°<^t<^ttaagatggagSS?™ 
tattccaccagctgctgaagctcttaatggtgttaagatggagttg'gttga?gcSSStc 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTG'GTTSTGCCGCASJr 

tattccaccagctgctgaagctcttaatggtgttaagatggagtt3S?S?S?gS?Sc 

tattccaccagctgctgaagctcttaatggtgttaagatggagttg'gttgatgcScaS^c 

tattccaccagctgctgaagctcttaatggtgttaagatggagttSSSS^SSS^^ 

S^^^^^''^^''^^^^^°^°=^<=^taatggtgttaagatggagttg'gSgS?gSScS^ 

JJ^^^^^'^^^^^^^^^^^^SCtcttaatggtgttaa.gatggagttg'gttStScSSStc 

TATTCCACCAGCTGCTGAAWCTCTTAATGGTGTTAAGATGGAGTTG'GTTGATGCcSciJ^^r 
TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTG'GTTGATGCciciS^ 

tattccaccagctgctgaagctcttaatggtgttaagatggagttSttgatcccgSJSp 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGSdSSSc^ScMSc 

tattccaccagctgctgaagctcttaatggtgttaagatggagSSSSSSgSS^c 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTdOTTSSScCGSS?^ 

J^I^^^'^'^^^^^^^^t^^gctcttaatggtgttaagatggagttSSSScSc^^^^ 

I^^!^^'''''=^°^^°^^^^°^T<=^TT^TGGTGTTAAGATGGAGTTdGTTSTCS2cA?Sc 

tattccaccagctgctgaagctcttaatggtgttaagatggagttggttgatgSgS?5c 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTdGTTGA?GcScAS^r 
TATTCCACCAGCTGCTGAAGCTCTTA.z.TGGTGTTAAGATGGAGTiS™ScCGciSTC 
TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTGGTTGATCCCGSSJr 

tattccaccagctgctgaagctcttaatggtgttaagatggaSJSSSSccgcatSc 

™^^^^^^^^^'^'^^°^°^t^ttaatggtgttaagatggagttSSSScS 

tattccaccagctgctgaagctcttaatggtgttaagatggagttggttgatgcScatS? 

™^^^''''^^^°^^^^^^^°^t^tt^j.tggtgttaagatggagttSSSS?cgca?Sc 

tattccaccagctgctgaagctcttaatggtgttaagatggagttgStSScS^Stc 

™^^*^^'''''^'^°^t°^°^^^tt^tggtgttaagatggagttggttgatgccgcatttc 
™^^^^^^«^^«^t°^gctcttaatggtgttaagatggagtSgJtS?gSgSStc 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTdSTGSGCCGcS?Sc 

tattccaccagctgctgaagctcttaatggtgttaagatggagtJSSSS?cg?™^ 

I^^^^^'''^^^°^^°^^°^'^CTCTTAATGGTGTTAAGATGGAGTTdSTGA?GcSSS?c 
TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTTciGTTGATGSGSJS 

J^^^^^^^^^°"°ctgaagctcttaatggtgttaagatggagttSSgJSc^ 

TATTCCACCAGCTGCTGAAGCTCTTAATGGTGTTAAGATGGAGTxdGTTGATSScSSc 

^^^^'^''^''^^^^''^^^^^^^^^^Q^^ttaagatggagttSSStcSSS^ 



t SOS 
: 274 
: 271 

: 26S 
: 266 
: 264 
: 267 
: 267 
: 265 
: 266 
: 264 
: 263 
: 264 
: 264 
: 264 
; 263 
; 264 
264 
263 
264 
264 
263 
263 
263 
262 
263 
261 
261 
262 
261 
262 
261 
261 
260 
261 
260 
260 
259 
259 
258 
259 
258 
257 
256 
255 
251 
254 
232 
215 
181 
103 
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LpMDHh36 
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LpMDHh41 
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LpMDHh43 
LpMDHh44 
LpMDHh45 
LpMDHh46 
..LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhS 0 
LpMDHhS 1 
LpMDHhS2 
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LpMDHhS 4 
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LpMDHh63 
LpMDHh64 




CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGbACTGGTGTGAATGTT 

cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
gacttctcaagggagttgttgcaacaactga§gttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg|cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttgcactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttgcactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttgcactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttgcactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaac7u\ctgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttgbactggtgtgaatgtt 
cacttctcaagggagttgttgcgacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttg'cactggtgtgaatgtt 
cacttctcaagggagttgttgcaacaactgatgttgttgaggcttgcactggtgtgaatgtt 

CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTgAGGCTTGbACTGGB^HBSBB 

CACTTNTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGbACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGbACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGbACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAJ\CTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACpACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 

cacttctcaagggagttgttgcaacaactgatgttgttgaggcttgcactggtgtgaatg;gt 

CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 

CACTTCTCAAGGGAGHTG|f;TGCAACAJ\CTGATGTTGN;TGAGGCTH^ 

CACTTWTCAAGGGAGTTGHrTGCAACAACTGATGTWGTTGASjGCTO^ 

CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAJ^GGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAJ\GGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGCACTGGTGTGAATGTT 
CACTTCTCAAGGGAGTTGTTGCAACAACTGATGTTGTTGAGGCTTGGACTGGTGTGAATGTT 



370 
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282 
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329 

329 
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328 

326 

325 

326 

326 

326 

32S 
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326 

.325 

326 

326 

315 

325 

325 

324 

325 

323 

323 

324 

32.3 

324 

323 

323 

322 

323 

322 

322 

321 

321 

320 

321 

320 

319 

320 

317 

313 

316 

294 

277 

243 

165 
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380 



GCGGTTATGGTTGGTGG^ 



400 



420 



ATTCCCCRGGAAGGAGGGAATGGaAA GG^AGGATGTTaTnTr 



GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGSlAGGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGikGGATGTTATGTCTAA' 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^iAGGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^kGGATGTTATGTCTAA 
GCG[5TTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^GGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^kGGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^iAGGATGTTATGTCTAA 
GCGgjTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^kGGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGikGGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^GGATGTTATGTCTAA 

gcggttatggttggtggattccccaggaaggaggga.atggaaagg^!aggatgttatgtctaa 

GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGikGGATGTTATGTCTAA 

°^°^™'^^°^t^°'^°g^ttccccaggaaggagggaatggaaagg;^ggatgttatgtctaa 

GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGG^kGGATGTTATGTC?^ 

|Gcggttatggttggtggattccccaggaaggagggaatggaaagg3aggatgttatgtctaa 
gcggttatggttggtggattccccaggaaggagggaatggaaaggsIaggatgttatgtctaa 

^^^°^r''''°^''''''^''°'''^'"^^''^^^'"^°<^A°GGAATGGAAAGGfi!AGGATGTTATGTCTAA 

gcggttatggttggtggattccccaggaaggagggaatggaaagg;^!aggatgttatgtctaa 



GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAlAGGATGTTATGTCTAA 
?rnn^^^r°^^''''^°^''^^'''^'=^^°°^°°^^^°^T°G^GGAkGGATGTTATSS^ 

°^°^^^^^°°^t°°^°g^ttccccaggaaggagggaatgga;.agga!aggatgttatgtctaa 

^??^™°^™°^°°^TTCCCCAGGAAGGAGGGA.ATGGAAAGG;^kGGATGTTATGTSK' 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkcGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGTCTAA 

gcggttatggttggtggattccccaggaaggagggaatggaaaggaIaggatgttaSS?^ 
gcggttatggttggtggattccccaggaaggagggaatggaaagga!aggatgttatgtctaa 
gcggttatggi^tggtggattccccaggaaggagggaatggaaaggaIaggatgttatgtctaa 
gcggttatggttggtggattccccaggaj^ggagggaatggaaaggaIaggatgttatgtctaa' 

GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTSSSi^' 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTSS^c?^' 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGTCTAA 

GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGTCTAA 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGTC?SJ 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGTCTiJ 
GCGGTTATGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGxSJ^ 

^^°^™^°^^°^TG°ATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGTCTA^ 
°^°™T<2°^^°°TGGATTCCCCAGGAAGGAGGGAATGGAAAGGAkGGATGTTATGTCTAA 
JS^^^^^°<^^^°Q^<^°^TTCCCCAGGAAGGAGGGAATGGAAAGGAAGGATGTTATGTC?^ 

gcggttatggttggtggatIccccaggaaggagggaatggaaaggaaggatgttatgtctaJ 
gcggttatggttggtggattccccaggaaggagggaatggaaaggaaggatgSatotSJ^ 
gcggttatggttggtggattccccaggaaggagggaatggaaaggaaggatgttatgtctaa 
gcggttatggttggtggattccccaggaaggagggaStggaaaggaaggatgttatgtcta^ 

B WGGTTGGTGGATTCCCCAGGAAGGAGGGAATGGAAAG GAAGGATGTTaTnTr-Ti^A 
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440 



GAATGTTTCAATC 



460 



480 



rCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCrrnaaTTp" 



GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 
GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 

(3aatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccSaattgcaI 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgcaJ 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgcaI 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatcIgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatc|gcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatqcagccccgaattgca 



gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtxtcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgcA 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 
gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 

MftATGTTTCAATCTACAAATCT|AAGCATCTGCCCTTGAAGCCCATGCABcCCCf«AATTGCA 

naatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 

GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 

gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 

GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 

gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 

GAATGTTTCA ATCTACAAA TCTCAAGCATCTGCCCTTGflArJnr-a^,j.nzi..r-r-r.r.G7'_'\TTCC.". 



gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccat6cagccccgaattgca 

GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 

gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 

GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 

gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgcaI 

GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 
J^AATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 
GAATGTTTCAATCTACAAATCTCAAG|atCTGCCCTTGAAGCCCAT(HCAGCCCCGAATTGCA 
GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 
GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 
GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTG,iSA 

gaatgtttcaatctacaaatctcaagcatctgcccttgaagcccatgcagccccgaattgca 

GAATGTTTCAATCTACAAATCTCAAGCATCTGCCCTTGAAGCCCATGCAGCCCCGAATTGCA 
GAM^^^^^CAAAT^CAAG|^||tGCCCTTGAAGCCCAT(3CAGCCCCGAATTGCA 





CCCATGCAGCCCCGAATTGCA 
CCg|MjABcCCCGSfA|TGCA 




jmfGCAGCCCCGBATTGCA 
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500 
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60 



580 



600 



620 



tctattcctgagaagaacatcagttgtttgacccgcctagaccataAcagggcacttggtca 



TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAgAAGAACATfAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTfGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 

tctattcctgagaagaacatcagttgtttgacccgcctagaccataacaggIEHBHBBB 

TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACT@GGTCA 

tctattcctgagaagaacatcagttgtttgacccgcctagaccataacagggcacttggtca 

TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTgGGTCA 

tctattcctgagaagaacatcagttgtttgacccgcctagaccataacagggcacttggtca 

TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTpGGTCA 

tctattcctgagaagaacatcagttgtttgacccgcctagaccataacagggcacttggtca 
tctattcctgagaagaacatcagttgtttgacccgcctagaccataacagggcact|ggtca 
tctattcctgagaagaacatcagttgtttgacccgcctagaccataacagggcacttggtca 

TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAikcAGGGCACTTGGTCA 



TCTN;TTCCTGAGAA 

TCTATTCCTGAGAAGAACATCAGpGTTTGACCCGKCTAGACCATAACAGGGCACTfeGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTGGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 

tctattcctgagaagaacatcagttgtttgacccgcctagaccataacagggcact|ggtca' 

TNTATTCCTGASAAGAACATgAWTTGTTTGACCCGCCTAGACCATAACAN^ 

tctat|cctgagaagaacatcagttgtttgacccgcctagaccataacagggcacttggtc^ 

TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTPA 



tctattcctgagaagaacatcagttgtttgacccgcctagaccataAcagggcacttggtca 

TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGgCTAGACCATAkcAGGGCACTSGGScA 
TCTATTCCTGAGAAGAACAT CAGTTGTTTGACCCGCCTAGACCATAACAGGGCAnTTr;p;Trz\ 



TCTATTCCTGAGAAigAACATCAGgTGTTTGACCCGCCTAGACCATAACAGGgCACTgGGMCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAkcAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTTGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTSgGTCA' 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAACAGGGCACTgGGTCAj 
TCTATTCCTGASfAAGAACATgAgTTGTTTGACCCGCCTAiACCATAACAGGGCACTTGGTCA 

tctattcctgagaagaacatcagttgttxgacccgcctagaccataIacagggcacttggtca 
|TCtattcctgagaagaacatcagtt|tttgacccgcctagaccata!acagggcacttggtca 

TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAkcAGGGCACTTGGTCA' 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAkcAGGGCACTgGGTCA 
TCTATTCCTGAGAAGAACATCAGTTGTTTGACCCGCCTAGACCATAkcAGGGCACTTGGTCA 

tctattcctgagaagaacatcagttgtttgacccgcctagaccataIacagggcacttggtca 
tctattcctgagaagaacatcagttgtttgacccgcctagaccata!acagggcacttggt|a 
tctattcctgagaagaacatcagttgtttgacccgcctagaccataIacagggcactPggtca 
tctattcctgagaagaacatcagttgtttgacccgcctagaccataIacagggcacttggtca 

TCTATTCCTGAGAAGAAC ATCAGTTGTTTGACCCGCCTAGACCATAkcAGGGCArTTGGTra 



^^^^ 



640 



660 



680 
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LpMDHh27 
IipMDHh28 
LpMDHh29 
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LpMDHhS 4 
LpMDHhSS 
LpMDHhS 6 
LpMDHhS? 
LpMDHhSS 
LpMDHhS 9 
LpMDHh60 
LpMDHh61 
LpMDHh62 
LpMDHh63 
LpMDHh64 



GATCTCTGAGAGACTTGATGgjCCAAGTTAGTGATGTGAANAATGTTgiTCATCTGGGGCAATC 



GATCTCTGAGAGACTTGATGgCCAAGTTAGTGATGTGAAGAATGTTATCATCTGGGGCAATC 
GATCTCTGAGAGACTTGATGTCCAAGTTAGTGATGTGAAGAATGTTATCATCTGGGGCAATC 
GATCTCTGAGAGACTT§ATGTCCAA^TTAGTGATGTGAA^AATGTTATCATCTGGGGgAATC 
GATCTCTGAGAGACTTGATGTCCAAGTTAGTGATGTGAAGAATGTTATCATCTGGGGCAATC 
GATCTCTGAGAGACTTGATGTCCAAGTTAGTGATGTGAAGAATGTTATCATCTGGGGCAATC 
GATCTCTGAGAGACTTGATGTCCAAGTTAGTGATGTGAAGAATGTTkTCATCTGGGGCAATC 

gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
Igatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 



gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggjgaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagaggcttgatgtccaagttagtgatgtgaagaatgttatcatctgggggaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttIatcatctggggcaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctgggggaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgtt|atcatctggggcaatc 
gatctctgagaggcttgatgtccaagttagtgatgtgaagaatgtt|atcatctgggggaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgtijatcatctggggcaatc 



GATCTCTGAGAGACTTGATGTgCAAGTTA>|gGATGTGAAWAATG^TATCATCTGGGG^^^^ 
GATCTCTGAGAGACTTGATGTCCAAGTTAGTGATGTGAA^AATGTTATCATCTGGGGCAATC 

gatctctgagagacttgatgtccaagttagtgatgtgaagaatgtllatcatctgggggaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 

GATCTCTGAGAGG,CTTGATGTCCAA GTTAGTGATGTGAAGAATGTTATCATCTGGGGjr:AATC 
"r;" TCTgT§AS(AGACTTG§THTCSAg 

'gatctctgagagacttgatgtccaagttagsgatgnp 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgt'i'atcatctggggcaatc 



GATCTCTGAWAGACTTGATGTCCAAGTTA; 

gatctctgagagacttgatgtccaagttagtgatg 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 



GAi^CTCTGAGAGACNTGATG^CCAAGMTgGNGgTGgl 
GATCTCTGAGAGACTTGATGTCCAAGTTAGTGATGTGAAGAATGTTATCATCTGGG'ggAATC 
GATCTCTGAGAGACTTGATGTCCAAGTTAGTGATGTGAAGAATGTTATCATCTGGGGCAATC 

gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctgggg^aatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggglaatc 
gatct^tgagagacttgatggccaagttagngatgtgaa^aatgttatcatmtggggcaa^ 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggjgaatc 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatctctgagagacttgatgtIcaagttagtgatgtgaanaatgStatcatctggnS^g 
gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
|gatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctgggg§aatc 
Igatctctgagagacttgatgtccaagttagtgatgtgaagaatgttatcatctggggcaatc 
gatct ctgagagacttgatgtccaagttagtgatgtg aagaatg tlratgatctggggcaatc 

Igcaatc 



680 



640 
638 
636 
639 
639 
637 
638 
636 

636 
636 
6*36 
63S 
636 
636 
63S 
636 
636 



63S 
634 
63S 
633 
633 
634 
595 
634 
633 

599 
606 
631 



60S 
630 
631 
630 
629 
630 
627 
623 
626 
604 
587 
553 
475 
304 
226 
206 
202 
6 



LpMDHh2 
LpMDHh3 
LpMDHh4 
LpMDHhS 
LpMDHhS 
LpMDHh? 
LpMDHha 
LpMDHhS 
LpMDHhlO 
LpMDHhll 
LpMDHhl2 
IipMDHhl3 
LpMDHhl4 
LpMDHhlS 
LpMDHhie 
LpMDHhl? 
XipMDHhlS 
LpMDHhlS 
LpMDHh2 0 
LpMDHh2 1 
LpMDHh22 
LpMDHh23 
LpMDHh24 
LpMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh2S 
LpMDHh30 
LpMDHh31 
LpMDHh32 
LpMDHh34 
LpMDHh35 
LpMDHh36 
LpMDHh37 
LpMDHh'38 
LpMDHh3 9 
LpMDHh40 
LpMDHh41 
LpMDHh42 
LpMDHh43 
LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
,LpMDHh49 
. LpMDHhS 0 
LpMDHhS 1 
LpMDHhS 2 
LpMDHhS 3 
LpMDHh54 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHh60 
LpMDHh61 
LpMDHh62 
LpMDHhS 3 
LpMDHh64 



CAGTCAG 



700 * 720 



740 



GAAGACiTTCCAGTGiSCGAGAAG 



SBBRiS 



jACTCTTCCAG 

ACTCTTCCAGTCAGTACCCTGfl 

j3^:!^^^^<^^C^T^^CCTGAT£5TGAACCACiccgCCMgAAi;kqTTCC^^ 
J^!^J^^^^°^^^°TACCCTGATGTGAACCACGCCACc5TCAASAclrTCCAG?GGCGA~ 
ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAAGAC^TTCCAGiGGCGAGAiG 

actcttccagtcagtaccctgatgtgaaccacgccaccgtgaaIagt^^M 

f^^iE^^^'^^^^'^C^GTACCCTGATGTGAACCACGCCACCGTGAAGAdTlS^CGAGAAG 

^ctcttccagtcagtac^^ 



^^^^^^'^''^^^CAgiTACCCTGATGTGAACCACGCCACCGTGAAGAGTTCCAGTGGH 

^^^^Z^'''''''''^^^;S™'=^<=^g^tgtgaaccacgccaccgtgaagaq'ttccagtgg-6gagaag 

^^^^^^^'^''''^^^■T^C^^TGATGTGAACCACGCCACCGTGAAGAdTTCCAGTGGCGJGJjG 
J^^^^^^^^^'^^^^TACCCTGATGTGAACCACGCCACCGTGAAGAdTTCCAGTGGCGlGJJG 
ACTCTTCCAGTCAiTACCCTGATGTGAACCACGCCACCGTGAAGAdTTCCACTGSGSGjjG 
ACTCTTMCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAAGAdTTCCAGTGGCGAGJiG 
ACTCTTCCAGTCASTACCCTGATGTGAACCACGCCACCGTGAAGAdTTCCAGTGGCGAG^jG 

actcttccagtcagtaccctgatgtgaaccacgccaccgtgaagaqttccagtggcgJgJ^g 

ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCA CCGTGAA GAOTTMCAGTGGCGA^^g 



ACTCTTgCAGWC 

ACTCTTCCAGTC^ 

^^!^!:;!fe^^^°^^^5'TACCCTGATGTGAACCACGCCACCGTGAAijAGTT iCAGTGGCGAGAAG 
ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAA&AdTTHcAGTGGCGAGAAG 
^^^^r'=^^^°T^^§™CCCTGATGTGAACCACGCCACCGTGAAGAdTTCCAGTGGcSG^G' 
ACTCTTCCAGTCAlTACCCTGATGTGAACCACGCCACCGTGAAGAdTTCCAGTGGSS^G 



^n^^r^'''''''''^''^™'^'^'^^°^^^^°^^'^^^°^^^^^^gGAAG AGTTCCAGTGiSCGAGAlJ 
ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACC^ 



CTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAJ 



AGAGTTWCAGTGG 



ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAAGAciTTCCAGTGG 
J^^^^^^^^^'^'^^^^TAC^C'^G^^GTGAACCACGCCACCGTGAAGAdTJcSSGGCGAGAAG 
ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAAGAdTTCCAGTGGCGAGAAG 
ACTCTTCCAGTCAiTACCCTGATGTGAACCACGCCACCGTGAAGAdTTsSGTCGCGSH 
ACTCTTCCAGTCAdTACCCTGATGTGAACCACGCCACCGTGAAGAdTTicACTScGSi 
ACTNTTCCAGTCAGTicCCTGATGTGAACCACicc.gcCGPAAlAdTTdcAG^^SHH 
ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTdAAdAdTTCCAGS^^H 
CTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAAGAGTTCCAGT^^M^ 
ACTCTTCCAGTCAGTACCCTGATGTGAACCACGCCACCGTGAGGAdTTCCACTGGCGAGjjG 
.^^^r^^^^-f^^TACCCTGATGTGAACCACGGCACCGTGAKGAdTTcSGTGGCGAGSSG 

actcttccagtcagtaccctgatgtgaaccacgccaccgtgaWttcS^^ 

ACTCTTJJpAlgCKiT§CCCTGATGiWiccFfCGCC.^CGBB^^^^^B^SfflHHH 

^S^^^^^^^TCAiTACCCTGATGTGAACCACGCCACCGTGAAGAGTTCCAGTGGCGjGijG 
f^IJ^^^^^QT^^QTACCCTGATGTGAACCACGCCACCGTGAAGAGTTcSGTGGSAGjJG 

ACTCTTCCAGTCAGTACCCTGBHGTGAACCACGCCACCGTGAAGAd:TTCCAGTGGCGAGAAG 
-^^- TACCCTGATGTGAACCACGCCACCGTGAAG AdrTTrrar^-rnr^r^riBnn 7^ o 

TgsCMGTG GCiAGBAG 

gcg'agaag 



742 



650 
660 
693 
701 
701 
684 
700 
698 

692 
698 
698 
697 
698 
698 
697 
698 
698 



696 
646 
697 
695 
695 
696 

696 
6.95 



686 



685 
693 
692 
690 
692 
679 
676 
688 
666 
649 
615 
513 
366 
288 
268 
264 
67 
49 
14 
8 



^ij^Hhl 

IipMDHh2 
LpMDHh3 
LpMDHh4 
LpMDHhS 
LpMDHh6 
LpMDHh? 
LpMDHhS 
LpMDHhS 
LpMDHhlO 
LpMDHhll 
IjpMDHhl2 
LpMDHhlB 
LpMDHhl4 
IipMDHhlS 
LpMDHhie 
lipMDHhl? 
IipMDHhie 
LpMDHhl9 
LpMDHh20 
' IipMDHh21 
IjpMDHh22 
' LpMDHh23 
I*pMDHh24 
IjpMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh29 
IjpMDHh30 
IipMDHh31 
IipMDHh32 
LpMDHh34 
LpMDHh35 
X«pMDHh36 
IipMDHh37 
LpMDHh38 
LpMDHhS 9 
LpMDHh40 
LpMDHh41 
LpMDHh42 
IipMDHh43 
IipMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
Lp^[DH]l48 
IipMDHh49 
LpMDHhS 0 
LpMDHhS 1 
LpMDHhS 2 
LpMDHhS 3 
LpMDHhS 4 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHhS 0 
LpMDHhei 
LpMDHh62 
LpMDHh63 
LpMDHh64 



760 



CCTGTTCGCGAACTTGTTAAAGACGATG 



780 



800 



CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGNlTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAgGGirCATTGCCACTGTCCAi 



CCTGTTC 



CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTtrCATTGCCACTGTCCA 



CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTjTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGOTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTyrCATTGCCACTGTCCA 

cctgt-tcgcgaacttgttaaagacgatgaatggctIaatgcagggttcattgccactgEicca 
cctgttcgcgaacttgttaaagacgatgaatgg ctaaatgcagggt!tcattgccactgtcca 



CCTGTTCGCGAACT] 

CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTjTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTfTCATTGCCACTGTCCA 

cctgttcgcgaacttgItaaagacgatgaatggctaaatgcagggtItcattgccactgtcca 



:CTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTirCATTGCCACTGTCCA 



CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTfrCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTJTCATTGCCACTGTCCA 



CCTGTTCG 



CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTfTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGAT GAATGGCT7\AATGCA GGGT[rCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATgHHSJBHjgjBJJ^Jpgl 

CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTlTCATTGCCACTGTCCA 



CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTlrCATTGCCACTGTCCA 
|CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTD?CATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTTCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTrrCATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTtcATTGCCACTGTCCA 
CCTGTTCGCGAACTTGTTAAAGACGATGAATGGCTAAATGCAGGGTTCATTGCCACTGTCCA 

cctgttcgcgHacttgttaaagacgatgaatggctaaatgcagggtjtcattgccactgtcca 

C CTGTTCGC GAACTTGTTAAAGACGATGAATGGCTAAATGrAC:;nr;^frz\T-Tr:r-r-7xnTr-~TCC_^. 



820 



840 



860 



,pI»©Hhl 
LpMDHh2 
LpMDHh3 
LpMDHh4 
LpMDHhS 
LpMlDHhS 
lipMDHh? 
IjpMDHhS 
IjpMDHh9 
LpMDHhlO 
LpMDHhll 
IipMDHhl2 
IjpMDHhl3 
LpMDHhl4 
LpMDHhlS 
I,pMDHhl6 
LpMDHhl? 
LpiyiDHhlS 
LpMDHhlS 
IipMDHh20 
IjpMDHh21 
LpMDHh22 
I,pMDHh23 
LpMDHh24 
X.pMDHh25 
LpMDHh26 
IipMDHh27 
IipMDHh28 
IipMDHh29 
IipMDHh30 
IipMDHh31 
IipMDHh32 
IipMDHh34 
LpMDHhS 5 
LpMDHhS 6 
LpMDHh37 
LpMDHhS 8 
LpMDHhS 9 
LpMDHh40 
LpMDHh41 
LpMDHh42 
LpMDHh43 
LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhS 0 
LpMDHhSl 
LpMDHh52 
LpMDHhS 3 
LpMDHhS 4 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS 7 
LpMDHhS 8 
LpMDHhS 9 
LpMDHh60 
LpMDHh61 
LpMDHh62 
LpMDHh63 
LpMDHh64 



gcagcgtggtgctgcaatcatcaaagcgag 
gcagcgtggSIgctgcaatcatcaaag^^gaggaa^ 



GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTCTgCA 




GCAGCGTGGTG 

GCAGCGTGGTGCTGC7\ATCATCAAAG 
GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTCTNCAGTGC 



GCAGCGTGGTGCTGCAATCATA 



GCAGCGTG 

GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGC.T 



GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCT 
GCAGCGTGGTGCTGC?^TCATCAAAGCGAGGAAGCTCTCCAGTG 



GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTCTCCAGTGCTjCTCTCTGCTGCCAGCI 



GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTgjTCCAGTGCTCTlgTlgTGCTGCCAGCT 
GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTCTCCAGTGCTCTCTCTGCTGCCAGCT 
GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTCTCCAGTGCTCTCTCTGCTGCCAGCT 
GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTCTCCAGTGCTCTCTCTGCTGCCAGCT 
GCAGCGTGGTGCTGCAATCATCAAAGCGAGGAAGCTCTCCAGTGCTCTCTCTGCTGCCAGCT 

gcagcgtggtgctgcaatcatcaaagcgaggaagctctccagtgctctctctgctgccagct 
[gcagcgnggtgctgcaatcatcaaagcgaggaagctctccagtgctctctctgctgccagct 

hrAGCGTGnTnrTnrAATC:ATCAAAGCGAGGAAGCTCTCCAGTGC'I?CTCT CTGCTGCCAGCT 



793 
797 



801 



764 
769 
770 
779 
788 
794 
797 
802 



768 
783 
803 



779 



763 
790 



786 

772 

739 

490 
412 
392 
388 
191 
173 
137 
132 



880 



900 



G&KDHhl 
LpMDHli2 
LpMDHh3 
LpMDHh4 
LpMDHhS 
I«pMDHh6 
LpMDHh7 
LpMDHhS 
LpMDHh9 
LpMDHhlO 
LpMDHhll 
IipMDHhl2 
LpMDHhl3 
LpMDHhl4 
LpMDHhlS 
LpMDHhie 
LpMDHhl? 
LpMDHhia 
LpMDHhl 9 
LpMDHh20 
LpMDHh2a 
LpMDHh22 
LpMDHh23 
LpMDHh24 
LpMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh29 
LpMDHh30 
LpMDHh3X 
LpMDHh32 
LpMDHh34 
LpMDHh35 
LpMDHhS 6 
LpMDHh37 
LpMDHh38 
LpMDHh39 
LpMDHh40 
LpMDHh4X 
LpMDHh42 
LpMDHh43 
LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhS 0 
LpMDHhS 1 
LpMDHhS 2 
LpMDHhS 3 
LpMDHhS 4 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS 7 
LpMDHhS 8 
LpMDHhS 9 
LpMDHhS 0 
LpMDHh61 
LpMDHh62 
LpMDHhS 3 
LpMDHh64 



920 



CTGCTTGTGACCACATCCGTGATl 



CTGCTTGTGACCACATCCGgGATTGGGTTCTCGGAACCCCTGAi^GGAACATTTGTTTCCATG 
CTGCTTGTGACCACATCCGTGATTGGGTTCTfqGGAACCCCTGAGGGAACATTTGTTTCCATG 
|CTGCTTGTGACCACATCCGTGATTGGGTTCTCGGAACCCCTGAGGGAACATTTGTTTCCATG 
CTGCTTGTGACCACATCCGTGATTGGGTTCTCGGAACCCCTGAGGGAACATTTGTTTCCATG 
CTGCTTGTGACCACATCCGTGATTGGGTTCTCGGAACCCCTGAGGGAACATTTGTTTCCATG 
CTGCTTGTGACCACATCCGTGATTGGGTTCTCGGAACCCCTGAGGGAACATTTGTTTCCATG 
CTGCTTGTGACCACATCCGTGATTGGGTTCTCGGAACCCCTGAGGGAACATTTGTTTCCATGI 
CTGCTTGTGA CCACATCCGTGATTG GGTTCTCGGAArprrTf^anr:r!:a2^rz^TTTnTT^TrT-7^T^ 



763 

552 
474 
454 
450 
253 
235 
199 
194 



940 



LpMDHh2 
LpMDHh3 
ripMDHh4 
LpHDHhS 
LpMDHh6 
LpMDHh? 
LpMDHhS 
LpMDHhS 
LpMDHhlO 
LpMDHhll 
LpMDHhl2 
LpMDHhlS 
IipMDHhl4 
LpMDHhlB 
IipMDHhie 
IipMDHhl? 
LpMDHhlB 
ljpMDHhl9 
LpMDHhS 0 
LpMDHh21 
I LpMDHh22 
' LpMDHh23 
LpMDHh24 
LpMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh29 
LpMDHhS 0 
LpMDHh31 
LpMDHh32 
LpMDHh34 
LpMDHh35 
LpMDHhS 6 
LpMDHh37 
LpMDHh38 
LpMDHhS 9 
LpMDHh40 
LpMDHh41 
LpMDHh42 
LpMDHh43 



960 



LpMDHh44 
'LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
: LpMDHhS 0 
LpMDHhS 1 
LpMDHh52 
LpMDHhSS 
LpMDHh54 
LpMDHhSS 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS9 
LpMDHheo 
LpMDHhei 
LpMDHh62 
LpMDHhSS 
LpMDHh64 



980 



GGTGTGTATTCTGATGGTTCATACGGTGTGCCTcHTGGGCTTATCTAtTCCTTCCCAGlAAC 
G.gTGTGTATTCTGATGGTTCATACGGTGTGCCTGCTGGGCTTATCTAtTcS?ccS^ 
GGTGTGTATTCTGATGGTTCATACGGTGTGCCTGCTGGGCTTATCTAtTCCTTCCCAGTiSc 
GGTGTGTATTCTGATGGTTCATACGGTGTGCCTGCTGGGCTTATCTAtxcSJcSiSSSc 
GGTGTGTATTCTGATGGTTCATACGGTGTGCCTGCTGGGCTTATCTAtTCCTTCCCAGTAic 
GGTGTGTATTCTGATGGTTCATACGGTGTGCCTGCTGGGCTTATCTAtTCCTTCCCAGTAAC 
GGTGTGTATTCTGATGGTTCATACGGTGTGCCTGCTGGGCTTATCTAtTCCTTCCCAGTAAC 



613 
536 
516 
512 
315 
297 
261 
256 




1000 



LpMDHhl 
LpMDHh2 
LpMDHhS 
IipMDHh4 
LpMDHhS 
LpMDHh6 
LpMDHh? 
LpMDHhS 
LpMDHh9 
LpMDHhlO 
LpMDHhll 
LpMDHhl2 
LpMDHhl3 
IipMDHhl4 
LpMDHhlS 

LpMDHhl? 
lipMDHhlS 
LpMDHh.19 
IipMDHh20 
I>pMDHh21 
I LpMDHh22 
LpMDHh23 
LpMDHh24 
IipMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh29 
LpMDHha 0 
IipMDHh31 
LpMDHh32 
IipZ^Hh34 
IipMDHh35 
LpMDHh36 
LpMDHh37 
LpMDHh38 
LpMDHh39 
XipMDHh40 
IipMDHh4X 
LpMDHh42 
J IipMDHh43 
LpMDHh44 
IipMDHh45 
IipMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
IipMDHhSO 
LpMDHhS 1 
IipMDHh52 
LpMDHhS 3 
LpMDHhS 4 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHhS 0 
LpMDHhS 1 
LpMDHh62 
LpMDHhS 3 
LpMDHhS4 



1020 



1040 




664 
598 
S78 
574 
377 
359 
323 
318 
28 



LpMDHh2 
LpMDHh3 
IipMDHh4 
LpMDHhS 
LpMDHhG 
LpMDHh? 
LpMDHhS 
LpMDHhS 
LpMDHhlO 
LpMDHhll 
•LpMDHhl2 
LpMDHhl3 
LpMDHhl4 
LpMDHhlS 
LpMDHhie 
LpMDHhl7 
LpMDHhlS 
LpMDHhl9 
LpMDHh20 
LpMDHh21 
;LpMDHh22 
'LpMDHh23 
LpMDHh24 
LpMDHh25 ' 
LpMDHh2 6 
LpMDHh27 
LpMDHh28 
LpMDHh29 
LpMDHhS 0 
LpMDHhai 
LpMDHhS 2 
LpMDHhS 4 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHh40 
LpMDHh41 
LpMDHh42 



1060 



1080 



LpMDHh4S 



'LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhS 0 
LpMDHhS 1 
LpMDHhS2 
LpMDHhS S 
LpMDHh54 
LpMDHhSS 
LpMDHhS 6 
•LpMDHhS? 
LpMDHhSS . 
LpMDHhS 9 
LpMDHh60 
LpMDHh61 
LpMDHh62 
LpMDHheS 
LpMDHh64 



1100 



AGATGGATGCCACAGCCCAGGAGCTCTCGNAGGAGAAGGCTCTCGSfASS^ 

agatggatgccacagcccaggagctctcggaggagaaggctctcgSESSgS 

AGATGGATGCCACAGCCCAGGAGCTCTCGGAGGAGAAGGCTcJcScSSS^^ 



.x^^xv-^UM^u/^UAAGGCTCTCGCCpACTCGTGCCTCGAG 

SCCpACTCGgGCCTCGAG 
gCTCGGAGGAGAAGGCTCTC GCCITACTnGTnrrTrnZir; 



660 
640 
636 
439 
421 
S8S 
380 
89 
38 




1120 



LpMDHhl 
LpMbHh2 
LpMDHhB 
LpMDHh4 
LpMDHhS 
LpMDHh6 
LpMDHh? 
LpMDHhe 
XipMDHhd 
LpMDHhlO 
LpMDHhll 
LpMDHhl2 
LpMBHhia 
IipMDHhl4 
LpMDHhlS 
LpMDHhl 6 
LpMDHhl? 
LpMDHhl 8 
LpMDHhl9 
LpMDHh20 
LpMDHh21 
)LpMDHh22 
LpMDHh23 
Lp.MDHh24 
LpMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh2 9 
LpMDHhaO 
LpMDHh31 
LpMDHhS 2 
LpMDHh34 
LpMDHhS 5 
LpMDHhS 6 
• LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHh40 
LpMDHh41 
LpMDHh42 



LpMDHh43 



'LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhS 0 
LpMDHhS 1 
LpMDHhS 2 
LpMDHhS 3 
LpMDHh54 
LpMDHhSS 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHh60 
LpMDHhe 1 
LpMDHh62 
LpMDHh63 
LpMDHh64 



1140 



1160 



11 



TAACTGCATACCAGGGAGCAGCTGCCGCTCTGATGTTTTGAATAAAJ^GGAACATTTTGGCTW 
TAACTGCATACCAGGGAGCAGCTG|CGCTCTGATGTTTTGAATAAAJHGWACATTTTGlfc 
TAACTG CATACGAGGG AG CAGCTGC CGCTGtBBBHSBBXEBBS^B^BEBBBH' 

taactgcataccagggagcagctgccgctctgatgttttgaaSaaSg^ 

taactgcataccagggagcagctgccgctctgatgttttgaataaa^Iggaacattttggctc 

taactgcataccagggagcagctgccgctctgatgttttgaataaa^Jggaacattttggctci 

TAACTGCATACCAGGGAGCAGCTGCCGCTCTGATGTTTTGAATAAAfllGGAACATTTTGGCTC 

taactgcataccagggagcagctgccgctctgatgttttgaataaa^Iggaacattttggctc 

TAACTGCATACCAGGGAGCAGCTGCCGCTCTG ATGTTTTGAATAAAj^GGAACATTTTGGr-Tr 



?22 
701 
€6? 
SOI 
483 
44? 
442 
151 
100 




• 80 



1200 



1220 



1240 



LpMDHhl 
LpMDHh2 
LpMDHh3 
LpMDHh4 
LpMDHhS 
LpMDHh6 
LpMDHh? 
LpMDHhe 
LpMDHhS 
LpMDHhlO 
LpMDHhll 
LpMDHhl2 
LpMpHhia 
LpMDHhl4 
LpMDHhlS 
lipMDHhie 
LpMDHhl? 
LpMDHhlB 
LpMDHhl 9 
LpMDHh20 
LpMDHh21 
^LpMDHh22 
' LpMDHh23 
LpMDHh24 
LpMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh29 
LpMDHh30 
LpMDHh31 
LpMDHh32 
■ LpMDHh34 
LpMDHh35 
LpMDHh36 
LpMDHh37 
LpMDHh38 ' 
LpMDHh39 
LpMDHh40 
LpMDHh41 
LpMDHh42 
LpMDHh43 
LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
IipMDHh48 
IjpMDHh49 
LpMDHhBO 
LpMDHhSl 
LpMDHhS 2 
LpMDHh53 
LpMDHh54 
LpMDHhSS 
LpMDHh56 
LpMDHhS? 
LpMDHhBS 
LpMDHhS 9 
LpMDHheO 
I»pMDHh61 
LpMDHh62 
LpMDHh63 
LpMDHh64 



} 



CATGAAACTCATCTCCACTCAGAACAGTTGCACATCGCGGTGCCTTTAGCTGGTTTTTCCAG 
CATGAAACTCATCTCCACTCAGAACAGTTGCACATCGCGGTGCCTTTAGCTGGTTTTTCCAG 
CATGAAACTCATCTCCACTCAGAACAGTTGCACATCGCGGTGCCTTpAGCTGGTTTTTCCAG 
CATGAAACTCATCTCCACTCAGAACAGTTGCACATCGCGGTGCCTTTAGCTGGTTTTTCCAG 
CATGAAACTCATCTCCACTCAGAACAGTTGCACATCGCGGTGCCTTIjAGCTGGTTTTTCCAG 
CATGAAACTCATCTCCACTCAGAACAGTTGCACATCGCGGTGCCTTTIAGCTGGT^TTTCCAG 



734 
705 

563 
545 
509 
504 
213 
162 




1260 



1280 



1300 



IipMDHhl 
LpMDHb2 
LpMDHha 
LpMDHh4 
LpMDHhS 
LpMDHhS 
LpMDHh? • 
LpMDHhS 
LpMDHhS 
LpMDHhlO 
LpMDHhll, 
. LpMDHhl2' 
LpMDHhlB 
LpMDiE]hl4 
LpMDHhlS 
LpMDHhie. 
LpMDHhl? 
LpMDHhlS 
LpMDHhl 9 
LpMDHh20 
LpMDHh21 
J LpMDHh22 
LpMDHh23 
LpMDHh24 
LpMDHh25 
LpMDHh26 
.LpMDHh27 
XipMDHh28 
LpMDHh29 
LpMDHhS 0 
IipMDHh31 
LpMDHhS 2 
LpMDHhS 4 
LpMDHhS 5 
LpMDHhS 6 
. I^pMDHhS? 
LpMDHhS S 
LpMDHhS 9 
LpMDHh40 
LpMDHh41 
LpMDHh42 
LpMDHh43 
'LpMDHh44 
LpMDHh45. 
LpMDHh46 
LpMDHh47 
LpMDHh4S 
LpMDHh49 
LpMDHhS 0 
LpMDHhS 1 
LpMDHhS 2 
LpMDHhS 3 
LpMDHh54 
LpMDHhSS 
LpMDHhS 6 
LpMDHhS? 
LpMDHhSS 
LpMDHhS 9 
LpMDHheO 
LpMDHhei 
LpMDHh62 
LpMDHh63 
LpMDHh64 



TGTGTATGAATGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTAlCAGGACAGGATATTG 
TGTGTATGAATGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTACAGGACAGGATATTG 
TGTGTATGAATGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTACAGGACAGGATATTG 
iTGTGTATGAATGAGGCTTTTGTAGC^CTATTTTCGCCTGATGATTTACAGGACAGGATATTG 
TGTGTATGAgTGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTACAGGACAGGATATTG 
TGTGTATGAATGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTACAGGACAGGATATTG 

IgwaSgnIgcttttgtagctctattttcgcctgPIgatttaIcaggacaggata^ 



625 
607 
S71 
566 
275 
224 
55 




1320 



1340 



1360 



LpMDHhl 
LpMDHh2 
LpMDHh3 
LpiyiDHh4 
LpMDHhS 
LpMDHh6 
LpMDHh? 
LpMDHhS 
LpMDHh9 
I,pMDHhlO 
LpMDHhl 1 
LpMDHhl2 
LpMDHhl3 
LpMDHhl4 
lipMDHhlS 
LpMDHhie 
LpMDHhl? 
LpMDHhlS 
LpMDHhlS 
'LpMDHh20 
L>pMDHh21 
. IipMDHh22 
' LpMDHh23 
IipMDHh24 
LpMDHh25 
IipMDHh26 
IjpMDHh27 
XipMDHh28 
LpMDHh29 
LpMDHh30 
LpMDHh31 
LpMDHh32 
LpMDHhS 4 
LpMDHh35 
LpMDHh36 
LpMDHh37 
LpMDHhS 8 
LpMDHh39 
LpMDHh40 
LpMDHh41 
LpMDHh42 
.LpMDHh43 
'LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhBO 
LpMDHhS 1 
LpMDHhS 2 
LpMDHhS3 
LpMDHh54 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHheO 
LpMDHhei 
LpMDHh62 
LpMDHh63 
LpMDHh64 



gcaggaagattnnaacaatttgacgtctgattaaaaccaacctcttalttat tccgigtgtgta l 

gcaggaagattggaacaatttgacgtctgattaaaacca 
gcaggaagattggaacaatttgacgtctgattaaaaccaacctcttajttattccl^^ 

gcaggaagattggtuvcaatttgacgtctgattaaaaccaacctcttattaBBBBBBBB 

gcaggaagattnnaapaatttgacgtctgattaaaaccaacctcttattat tcctgtgtgta 

gcaggaagattggaacaatttgacgtctga'§Saaaa§S!aaS: 



GCAGGAAGATTGGAACAATTTGACGTCTGATTAAAACCAACCTCTTABXATTCCTGTGTGTi 



687 
646 
633 
616 
337 
265 
116 




1380 



1400 



1420 



LpMDHhl 
IipMDHh2 
LpNDHh3 
LpMDHh4 
LpMDHhS 
LpMDHhe 
LpMDHh? 
LpMDHhS 
LpMDHhS 
LpMDHhl 0 
LpMDHhll 
LpMDHhl2 
LpMDHhia 
LpMDHhl4 
•LpMDHhl 5 
LpMDHhie 
LpMDHhl? 
LpMDHhlS 
LpMDHhld 
LpMDHh20 
LpMDHh21 



LpMDHh22 
LpMDHh23 
LpMDHh24 
LpMDHh25 
LpMDHh26 
LpMDHh27 
LpMDHh28 
LpMDHh29 
LpMDHh30 
LpMDHh31 
LpMDHh32 
LpMDHh34 
LpMDHhS 5 
LpMDHh36 
LpMDHh37 
LpMDHhS 8 
LpMDHh39 
LpMDHh40 
LpMDHh41 
LpMDHh42 
LpMDHh43 
LpMDHh44 
LpMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhS 0 
LpMDHhS 1 
LpMDHhS 2 
LpMDHh53 
LpMDHh54 
LpMDHhS 5 
LpMDHhS 6 
LpMDHhS? 
LpMDHhS 8 
LpMDHhS 9 
LpMDHh60 
LpMDHh61 
LpMDHh62 
LpMDHh63 
LpMDHh64 



TGAATGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTACAGGCdATGATATTGGCAGG 



TGAATGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTACAGGCCATGATATTGGCAGGA 



tgaatgaggcttttgtagctctattttcgcctgatgatttacaggtScatgatattggcagga 



TGAATGAGGCTTTTGTAGCTCTATTTTCGCCTGATGATTTACAGGCdAte'GATATTGGCAGGA 




1440 



1460 



1480 



LpMDHhl 
IipMDHh2 
LpMDHh3 
IipMDHh4 ' 
LpMDHhB 
LpMDHhg 
LpMDHh7 
LpMDHhS 
LpMDHh9 
LpMDHhlO 
LpMDHhll 
LpMDHhl2 
LpMDHhl3 
LpMDHhl4 
LpMDHhlS 
LpMDHhie 
LpMDHhl? 
LpMDHlllS 
LpMDHhl 9 
LpMDHh20 
LpMDHh2 1 
.LpMDHh22 
LpMDHh23 
IipMDHh24 
LpMDHh25 
ItpMDHh26 
LpMDHh27 
LpMDHh2 8 
IipMDHh29 
LpMDHh.30 
LpMDHh31 
LpMDHh32 
LpMDHh34 
LpMDHhS 5 
LpMDHhS 6 
LpMDHh37 
LpMDHh38 
LpMDHh39 
LpMDHh40 
LpMDHh41 
LpMDHh42 
LpMDHh43 
•LpMDHh44 
;:ipMDHh45 
LpMDHh46 
LpMDHh47 
LpMDHh48 
LpMDHh49 
LpMDHhBO 
LpMDHhS 1 
LpMDHhS 2 
LpMDHhS 3 
LpMDHh54 
LpMDHhSS 
LpMDHhS 6 
LpMDHhS? 
LpMDHhSS 
LpMDHhS 9 
LpMDHhS 0 
LpMDHh61 
LpMDHh62 
LpMDHh63 
LpMDHh64 



[ggattggaacaatttgacgcctgattajpj^ccaacctcttattactaKaaaaaaaB 

[GGATTGGAACAAi#MAj^^ 



^^^^re 3X Nucleotide sequence of LpMDHl 



* 20 * 40 * 60 

LpMDHi : GTNCATAAAGCTGCCCAAAGCAATNCGTGNAATATTATCAGTi^CCCTGTCAATTCTAC^ : 60 



* 80 * 100 ' * 120 
LpMDHi : GTACCAATTGCTGCTGAAGTATOTAAAAAAGCTGGQACATAC^ • 120 

* 140 * 160 ♦ 

LpMDHi : GGGGTTGACAACaTGTTNGATCaOTAima^^ . 17 7 



^^^^re 32 Deduced amino acid sequence of LpMDHi 



* 20 * 40 * 

LpMDHi : XHKAAQSlCKXNIISNPVlTSTV^IAAEVFKia^GTyi^KX^ : 58 




re 33 Nucleotide sequence of LpMDHj 



* 20 * 40 * 60 
IipMDHj : ANAAAGGAGCCJGACGCAGGGGCGCAGAATTCCATCTGCTNACTCrGCCyVCCACCC^ : 60 

* 80 . * 100 * 120* 
LpMDHj : GGACT^TGGCGTCAGCTGTTAaUVTCAGTTCAGTCAGCGCGa^GGCCGC^ : 120 

* 140 * 160 * . 180 
LpMDHj : ACaUlGGAACCATGGCAGCy^CQAGCTACAGTGGCCTAAAGGCATa^TC :.180 

* 200 * 220 * 240 
IipMDHj : CTTCGAATOlGGAACyVTCATTCCTGGGa^AGACCGCCTCCCTCCGGGCAACTGOT : 240 

* 260 * 280 * 300 
LpMDHj : AAGGGTTGTGCCAAAGGCGAAGTCTGGGTCGa^GATATOSCCTCAGGaVTCTO : 300 

V 

* 320 * 340 * 360 
LpMDHj : GGCGGTGCTTGGTGCTGCTGGTGGO^TCGGTOUICCACT : 360 

* 380 * 400 * 420 
LpMDHj : TCCTCTGGTCTCGGAGCTGCGCCTGTATGATATCGCGAATGTCAAGGGCGTCGCTGCAGA : 420 

* 440 * 460 * 480 
LpMDHj : TCTCAGCCaiCTGCTVACACGCerGCTCAGGTCATCG^ : 480 

* 500 * 520 * 540 
LpMDHj ! AGAGTGCTTGAAAGGTQTGGATGTTGTCGTCATCCCTGCGGGTGTCCCAAGG : 540 

* 560 * 580 * 600 
LpMDHj CATGACCCGTGATGACCTTTTTAACATlSrAATGCGGGAATCGNCAAGTCGCTTATTGA^ : 600 

' * 620 * 640 * 660 

LpMDHj : TGTTGCAGACAATTGCCCTX3AGGGCCTTATTC^ : 660 



LpMDHj : CCCT : 664 



^l^re 34 Deduced amino acid sequence of LpMDHj 



* 20 * -40 * • 60 

LpMDHj ! XRSMRGAEPHLXTLPPPKLDMASAVTISSVSAQAAIiVSKPRNHGSTSYSGLKASSSSIS : 60 



* 80 * 100 * . 120 

LpMDHj : PESGTSPLGKTASIiRATVTTRVVPKAKSGSQISPQASYKVAV^ : 120 



* 140 * 160 * 180 

LpMDHj : PLVSELRLYDIANViaSVAADLSHCaJTPAQVMDFTGPAELM : 180 



* 200 * 220 

LpMDHj : MTRDDLFNXNAGIXKSLIEAVADNCPEGLIHIINNPGQTPP : 221 



I 




re 35 Consensus contig nucleotide sec[uence of XjpMDHk 



* 20 * 40 * 60 

LpMDHk : TNTTTANCCCNCCyUVOTATCCAGNANCCACCTGGCCCTACACAI^^ 



60 



* 80 * 100 * 120 

LpMDHk : AACCAGNACGCAAGGGGCXSAGCCGGGGCGCACGCAGCTUVTTCCCaiTCTGCTC^ 



120 



* 140 * 160 * 180 

LpMDHk s AAQTTGGAGATGGCa^TO^GCTGTTACCSVTCAGCTCAGTCAGCGCGC^ 



180 



* 200 * 220 * .240 

LpMDHk : TCGAAACCAAGGAATCATGGCAGOVCAAGCTACa^GTGGCCTAAAGGCATCATCATC^ 



240 



* 260 * 280 * 300 

LpMDHk • : ATCAGCraCGAATCa^GGGACATCATTCCTGGGCAAGACC^ 



300 



* 320 * 340 * 360 

LpMDHk : ACCTa^GATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCyVCCTC^^ 



360 



* 380 * 400 * 420 
LpMDHk : AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAA6 : 420 

* 440 ^ 460 * 480 
LpMDHk : ATGTCTCCTCTXSGTCTCAGAGCTGaSCCTGTATGATATTGCaU^TGTC^ : 480 

* 500 . * 520 * 540 
LpMDHk : GCAGATCTCaiGC<»CrrGCAACACGCCTTCrrCAGGTCATGGACTO : 540 



* 560 .* 580 * 600 

LpMDHk : CTAGCTGACTGCTTGAAAGGTGTTGATGTTGTCGTCATCCCTGCGGGTGTCCCAAGGAAG 



600 



* 620 * 640 . * 660 

LpMDHk : CCAGGCATGACCCGTGATGACCTTTTTAACATCy^TGCGGGCaiTCGT 



660 



* 680 * 700 * 720 

LpMDHk : GAGGCTGTTGCAGAC^ACTGCCCTGAGGCCTTCATCCATATCATC^ 



720 



* 740 * 760 * 780 

LpMDHk : TCCACTGTGCCGATTGCTGCTGAGATTCTGAAACAGAAGGGCGTCTACAACCCC^ : 780 



* 800 * 820 * 840 

LpMDHk : CTCTTCGGGGTTTCCACCCTGGATGTTGTCAGAGCTAACA 



840 



* 860 * 880 • * 900 

LpMDHk : AACCTCaVGCCTO^TCGATGTTGATGTCCCAGTTGTCGGTGGCCATGCTGGGATCa^CGAT^ 



900 



* 920 * 940 ** -960 

LpMDHk : CTGCCTCTGTTGTCCAAGACTAGGCCTTCTGTOlGCTTCa^CGGACGAGGAAACTQ^ : 



960 




* 580. * 1000 * 1020 
Lp^MDHk : CTGACAAAGAGGATACAGAACGCTGGGACAGAGGCGGTGGAGGCGAAGGCTGGTGCTGGC : 1020 

* 1040 * 1060 * 1080 
LpMDHk : TCTGCTACTCTGTCCATGGCTTATGCCGCTGCa^GATTTGTTGAGT : 1080 

* 1100 * 1120 * 1140 
LpMDHk : ATGGCTGGTGATCCAGATGTTTACGAGTGCACGTATGTTCAGTCTGAGTTAACAGAGCTT : 1140 

* 1160 * 1180 * 1200 
LpMDHk I CCATTCTTCGCGTCCAQAGTTAAQCnTGGGAAGGACGGNGTTGAG : 1200 

* 1220 * 1240 * * 1260 
LpMDHk : GACCTGC^GGGAGTGACGGAGTACGAGGCCAAGGCGCTTGANGCAT^ : 1260 



LpMDHk : AAG : 1263 

) 



} 




e 36 Deduced amino acid sequence of LpMDHk 

* 20 * *0 * -60 

LpMDHk : XUCXQXSXXHIJUJHXXKTKXNQXARGEPGRTQQPPSAHQPKLEMMAVTISSVSAQAALV : 60 

80 * 100 * 120 



LpMDHk. I SKPRNHGSTSYSGLKASSSSISFESGTSPLGKTASLRATITSRIVPKAKSGSQISPQASY 



120 



* 140 * 160 *■ 

LpMDHk : KVAVLGT^GIGQPLGLLIKMSPLVSBLRLYDIANVKGVAADLSHCNTPSQVMDPTGPAB : 180 

* 200 * 220 ' * 240 
LpMDHk : LADCLKGVDVWIPAGVPRKPGMTRDDLBWINAOIVKSLIEAVADNCPEAFIHIISNPVN : 240 

* 260 * 280 • * 300 
LpMDHk : STVPIAAEILKQKGVyMPKKLPGVSTLDVVRANTFVAQKKin.SLIDVDVPWGGH?^I^ : 300 

\ * 320 * 340 * 360 

LpMDHk : LPLLSKTRPSVSFTDEBTEQLTKRIQNAOTEAVEAKAaMSATLSMAYAAARF^mSSLI^ : 360 

*■ 380 * 400 * 420 

.LpMDHk : MAGDPDVYECTYVQSELTELPFFASRVKLGKDXVESIISSDLEGVTBYEAKALXALKABL : 420 
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acid fragments contributing to the 



LpMDHkl : 
LpMDHk2 : 
LpMDHk3 ' : 
LpMDHk4 : 
LpMDHkS : 
LpMDHke : 
LpMDHk? : 
LpMDHkS 
LpMDHk9 : 
LpMDHkl 0 : 
LpMDHkll : 
LpMDHkl2 : 
L£^Hkl3 i 
LpMDKkl4 : 
LpMOHklS : 
LpMDHkl6 *: 
LpMDHkl? 
LpMDHkl 8 
LpMDHkl 9 
LpMDHk20 
) LpMDHk21 
LpMDHk22 
LpMDHk23 
LpMDHk24 
LpMDHk25 
LpMDHk26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDHk30 
LpMDHkBl 
LpMDHkS 2 
LpMDHk33 
LpMDHk34 
LpMDHkS 5 
LpMDHkS 6 
LpMDHkS? 
LpMDHkS 8 
LpMDHkS 9 
LpMDHk40 
LpMDHk41 
LpMDHk42 
) LpMbHk4S 
LpMDHk44 
LpMDHk45 
LpMDHk46 
LpMDHk4? 
LpMDHk48 
LpMDHk49 
L£^Hk50 
LpMDHkS 1 

Lf^HkSS 
LpMDHk54 
LpMDHkS 5 
LpMDHkS 6 
LpMDHkS? 
LpMDHkS 8 
LpMDHkS 9 
LpMDHkeO 
LpMDHkei 
LpMDHk62 
LpMDHkeS 
LpMDHk64 
LpMDHk65 
LpMDHk66 
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LpMDHkl 
LpMDHk2 
LpMDHkS 
LpMDHk4 
IipMDHkS 
LpMDHk6 
LpMDHk? 
XipMDHkS 
LpMDHk9 
IipMDHklO 
LpMDHkll 
LpMDHkl2 
LpMDHkl3 
LpMDH}cl4 
LpMDHklB ' 
LpMDHlcl6 
LpMDHkl? 
LpMDHkie 
LpMDHklS 
LpMDHk20 
LpMDHk21 
LpMDHk22 
LpMDHk23 
Iij«1DHk24 
)LpMDHk25 
LpMDHk26 
LpMDKk27 
LpMDHk28 
LpMDHk29 
IipMDHk30 
LpMDHk31 
IipMDHk32 
IipMDHk33 
LpMDHk34 
LpMDHkBS 
IipMDHk36 
bpMDHk37 
I.pMDHk38 
LpMDHk39 
IipMDHk40 
LpMDHk41 
IipMDHk4'2 
IipMDHk43 
LpMDHk44 
LpMDHk45 
LpMDHk46 
,LpMDHk47 
'lipMDHk48 
LpMDHk49 
XipMDHkSO 
EipMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHk54 
LpMDHkSS 
LpMDHkSe 
LpMDHkS? 
LpMDHkBS 
LpMDHkS 9 
LpMDHk60 
LpMDHkei 
LpMDHk62 
LpMDHk63 
LpMDHk64 
LpMDHk65 
LpMDHk66 



agiccagnacgcaaggggcgagccggggcgcacgcagcaattcccatctgctcagcaaccc 
aHccagnaBgcJJaggggcgagccggggcgcacgcagcaattcccatctgctcagcaaccc 
ai^ccagwwcgcaaggggcgagccggggcgcacghagctvattcccatctgctcagcaaccc 
aHccagnaIgcBaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
aj^ccagnabgcgaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
ajgjccagnscgcaaggggcgagcclfgggcgcacgbagcaattcccatctgctcaccaaccc 
aRccagnaBgcHaggggcga^ccggggcgcacgcagcaattcccatctgctcaccaaccc 
i&cagnacgc|aggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
aBccagwaIgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
aIccagnaIgcHaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 

AgcgS'^BCGCAAGGGGCGAGCCgGGGCGCACGjlAGCAATTCCCATCTGCTCACCAACCC 

naccagnwHgcHaggggcgagccggggcgcacgcagcaattcccatctgctcac^caaccc 
l^accagnahgclaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
agcia^jj^hgclaggggcgagccggggcgcacgcagcaattcccatctgctcacrcaaccc 

AHcCAGNAHGciAGGGGCGAGCCGGGGCGCACGCAGCAATTCCClTCTGCTCACCAACCC 

SaccagnaJBcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 

^'fcCAGNSlflcAAGGGGCGAGCCGGGGCGCACGCAGCAATTCCCATCTGCTCAicAACCC 

a6ccagBscgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
agccagnacgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
a[c^ccagnS^cgcaaggggcgagccggggcgcacgcagcaattcccatctgctcacj:caaccc 
a'&cagnwcgct^ggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
agccagnwcgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
aBccagnnHgcBaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
abccagnkflgcbaggggcgagccggggcgcacgcagcaattcccatctgctcacpcaaccc 
aiccagnahgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaecaaccc 
a^ccagnacgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
agccagflnjcgcaaggggcgagccggggcgcacgcagcaattcccatctgctcatcaaccci 
ai:^ccagnnpgcajvggggcgagccggggcgcacgcagc7u\ttcccatctgctcaccaaccc| 

SgCCHGN^flCAAGGGGCGAGCCGGGGCGCACGCAGCAATTCCCATCTGCTCACCAACCcl 

a|ccagisiacgcaaggggcgagccggggcgcacgcagcaattcccatctgctca(|:caacccl 
agccagnacgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccci 
alccagnabgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccavaccci 
aBccagnacgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
a:gccaga|Bgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
^&cagij|acgcaaggggcgagccggggcgcacgbagcaattcccatctgctcaccaaccc 
aBccagnacgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 

aBSJaB|aA;AALAAW 

AAiA/^fl|A&iV^.GGGGCGAGCCGGGGCGCACiCAGCAATTC^ 

^ rr^X'XX ^ — ^^^^ ^. — ^ r^^^^ ^^r^ TX^^ L^^ ^ 



AAg^flpAA^AAiAAgGGGC 



ahhwbgaj^a^aaggggcgagccggggcgcacgcagcaattcccatctgcgcaccaaccc 
a^ccaghnISgcaaggggcgagccggggcgcacgcagcaattcccatctgctcac^ 
§accagimacgca§ggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 
aBccagnacgcaaggggcgagccggggcgcacgcagcaattcccatctgctcaccaaccc 

AJiJ^AiAflNAgiA§AAGGGGCGAGCCGGGGCGCACGCAGCAATTCCCATCTGC§CACCM 
NACCAGNACGCAgGGGGCGAGCCGGGGCGCACGCAGCAATTCCCATCTGCTCACCAJ\CCC 

AAijl^AB^AA^^AAg 

AA|^^g|!AA& 
AAK2i%&^^ 
AASsMAclklL^^ 

aa!S§ana!a!$^ 

AAgb^iAG^^lAS!?^^ 
HSpAB.^AjAJAf^^ 

^B||a|^na^5Laangggcgagccggggcgcacgcagca^ 

^Hjl^MaCAAGGGGCGAGCCGGGGCGCACGCAGCAATTCCCATCTGCTCAC^ 

^^^^^^H^^aggggcnsgccggggcgcHcgcBgBaattBccatctgB3c§ccBaBcc 
^^^h^^habhhhj|gagccggggcwchcgcagcaattcccatctgctc 
^^^^^^^^^^^BBBBgggggc gcacgcaBcaattcccatctgctcaccaaccc 
^^^^^^^^^^^^^^^hs^^k.cabggagcaattccgitnctgci^icaccaaccc 

^^^^^^^^^^^^^^^^^BBGNCigCNgCAgjTgcCCjS t^CTGCipCACCA^ 
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LpMDHkl 
IipMDHk2 
LpMDHk3 
LpMDHk4 
LpMDHkS 
I<pMDHk6 
LpMDHk? 
LpMDHkS 
LpMDHk9 
LpMDHkl 0 
' LpMDHkl 1 
LpMDHkl 2 
LpMDHkl3 
LpMDHkl4 ' 
LfiMDHklS 

L£^Hkl7 
LpMDHkl 8 
LpMDHkl? 
L^MDHk20 
LipMDHk21 
LpMDHk22 
LpMDHk23 
LpMDHk24 
]LpMDHk25 
LpMDHk26 
LpMDHk27 
i;pMDHk28 
LpMDHk29 
LpMDHk30 
' LpMDHkBl 
LpMDHk32 
.LpMDHk33 
' LpMDHk34 
LpMDHk35 
LpMDHk36 
LpMDHk37 
LpMDHk38 
LpMDHk39 
LpMDHk40 
LpMDHk41 
L:pMDHk42 
LpMDHk43 
IjpMpHk44 
lipMbHk45 
i;pMDHk46 
.IipMDHk47 
'lipMDHk48 
LpMDHk49 
LpMDHkSO 
LpMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHk54 
LpMDHkSS 
LpMDHk56 
LpMDHk57 
LpMDHk58 
LpMDHk59 
LpMDHk60 
LpMDHk61 
LpMDHk62 
LpMDHk63 
LpMDHk64 
LpMDHk65 
LpMDHk66 
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IipMDHkl 
]JpMDHk2 
LpMDHkB 
LpMDHk4 
LprODHkS 
LpMDHk6 
LpMDHk? 
LpMDHkd 
LpMDHk9 ' 
LpMDHklO 
LpMDHkll 
LpMDHkl2 
LpMDHkia 
IjpMDHkl4 
LpMDHklS 
LJsMDHkie 
Xi£^MDHkl7 
LpMDHklS 
Lc>MDHkl9 
LpMDHk20 
IipMDHk21 
LpMDHk22 
LpMDHk23 
LpMDHk24 
)I,pMDHk25' 
Lt>MDHk26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDHkaO 
LpMDHkai 
lipMDH3c32 
IipMDHk33 
LpMDHk34 
XjpMDHk35 
IjpMDHk36 
LpMDHk37 
IjpMDHk38 
LpMDHk39 
IipMDHk40 
IjpMDHk41 
IipMDHk42 
IipMDHk43 
LpMDHk44 
•LpMDHk45 
IipMDHk46 



I,pMDHk47 
XjpMDHk48 
IipMDHk49 
LpMDHkSO 
LpMDHkSl 
LpMDHk52 
IipMDHk53 
LpMDHk54 
Ij£^Hk55 
IjpMDHk56 
IipMDHkS? 
LpMDHkSe 
LpMDHk59 
Lp]yiDHk60 
IipMDHk61 
IjpNaDHk62 
LpMDHk63 
LpMDHk64 
XjpMDHk65 
LpMDHk66 
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LpMDHkl 

LpMDHk2 

LpMDHk3 

LpMDHk4 

LpMDHkS 

L|>MDHk6 

LpMDHk? 

LpMDHkS 

LpMDHk9 

LpMDHkl 0 

LpMDHkl 1 

LpMDHkl2 

LpMDHkl3 

LpMDHkl4 

LpMDHkl 5 

LpMDHkl 6 

LpMDHkl? 

LpMDHklS 

LpMDHkl9 

LpMDHk20 

LpMDHk21 

LpMDHk22 

LpMDHk23 

];ipMDHk24 

) LpMDHk2S 
LpMDHk26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDHk30 
LpMDHk31 
LpMDHk32 
LpMDHkS 3 
LpMDHkS 4 
LpMDHk35 
LpMDHkS 6 
LpMDHkS 7 

' LpMDHkS a 
LpMDHkS 9 
3^^4DHk40 
' LpMDHk41 
LpMDKk42 
LpMDHk43 
LpMDHk44 
LpMDHk45 
LpMDHk46 



LpMDHk47 
LpMDHk48 
LpMDHk49 
LpMDHkSO 
LpMDHkBl 
LpMDHkS 2 
LpMDHk53 
LpMDHk54 
LpMDHkS 5 
LpMDHkS 6 
LpMDHkS? 
LpMDHkSS 
LpMDHk59 
LpMDHkeO 
LpMDHk61 
LpMDHk62 
LpMDHk63 
LpMDHk64 
LpMDHkeS 
LpMDHk66 



ATCAGCTTCGAATCAGGGACATCATTCCTGGGC7^GACC;;^CCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTj§GAATCAGGGACATC§TTCCTGGGCAAGAc5sGCCTC§CT|gCGGGCGACTgTr| 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTgGAATCAGGGACATCGTTCCTGGGCAAGACgGCCTCgCTgCGGGCGACTgTg 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCCpACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTT§GAATCAGGGACATCgTTCCTGGGCAAGACa]GNCTC§CT^CGGGC(5ACTgTg 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCANGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCG?y\TCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAJAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCpACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTT;fpAATCAGGGACATCGTTCCTGGGCAAGAC@GCCTC§CT.§CGGGCGACTGT"p 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGcjsACTATC 

ATCAGCTTCGAATC^GGSACATCATTCCTGGGCAAGAC[fGCCTCTCTTCGGGCGpjCAqTC 

ATCAGCTTCGAATCjl^GGl^kcATCATTCCTGGGCAAGACfijGCCTCTCTTCGGGC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGCGACTATC 

ATCAGCTTCGAATC[fGGSACATCATTCCTGGGCAAGACffiGCCTCTCTTCGGGcbpiCA^ 

ATCAGCTTCGAATCifGGHACATCATTCCTGGGCAAGACjljGCCTCTCTTCGGGcjsjij^^^ 

atcagcttcgaatc(rgg^catcattcctgggcaagacjggcctctcttcgggcg3c7^^^ 
atcagcttcgaatcagggacatcattcctgggcaagaccgcctctcttcgggcpactatc 
atcagcttcgaatcagggacatcattcctgggcaagaccgcctctcttcgggcgactatc 
atcagcttcgaatcagggacatcattcctgggcaagaccgcctctcttcgggc'gactatc 

ATCAGCTTCGAATCjljGGgACATCATTCCTGGGCAAGACgGCCTCTCTTCGGGCGgc^TC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGcbACTATC 

ATCAGCTTCGAATC[rGGpACATCATTCCTGGGCAAGAC[|GCCTCTCTTCGGGcbpCA^TC 

ATCAGCTTCGAATCJGGl^CATCATTCCTGGGCAAGAc|iGCCTCTCTTCGGGCp 

ATCAGCTTCGAATc|rGG!A5CATCATTCCTGGGCAAGACg^GCCTCTCTTCGGGCppCA^^ 

ATCAGCTTCGAATCffiGGl^CATCATTCCTGGGCAAGACfiGCCTCTCTTCGGGCGTCAS.TC 



atcagcttcgaatcJtggI/^catcattcctgggcaagacigcctctcttcgggcctca^ 

ATCAGCTTCGAATCfEGG*%CATCATTCCTGGGC7^GACTGCCTCTC 

ATCAGCTTCGAATC{l-|GG5kcATCATTCCTGGGCAAGACtGCCTCTCTTCGGGCGm^ 

ATCAGCTTCGAATCflGGAkcATCATTCCTGGGCAAGAClIGCCTCTCTTCGGGCGflcKCT 

ATCAGCTTCGAATCAGGGiCATCATTCCTGGgCAAGACCGNCTCTCTTCGGGGGACTATC 

ATCAGCTTCGAATCgGGSACATCATTCCTGGGCAAGACgGCCTCTCTTCGGGGGgCAGTC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGGGACTATC 

ATCAGCTTCGAATCAGGGACATCATTCCTGGGCAAGACCGCCTCTCTTCGGGOGACTATC 

ATCAGCTTCGAATCp|GGpACATCATTCCTGGGCAAGAC[?GCCTCTCTTCGGGCGmCAGTC 

ATCAGCTTCGAATCgGGi^CATCATTCCTGGGCAAGACjf^GCCTCTCTTCGGGdGT^^^ 

ATCAGCTTCGAATCjtiGGlftlACATCATTCCTGGGCAAGACli GCCTCTCTTCGGGGGP ^ 
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IjpMDHkl 
XipMDHk2 
XipMDHk3 
LpMDH)c4 . 
LpMDHkS * 
IipMDHkS 
IjpMDHkV 
LpMDHka 
IipMDHk9 
LpMDHklO 
LpMDHkll 
IipMDHkl2 
IipMDHkia 
IipMDHkl4 
LpMOHklB 
IipMDHkl6 
LpMDHkl7 
LpMDHklS 
LpMDHklS 
LpMDHk20 
LpMDHk21 
IjpMDHk22 
IipMDHk23 
LpMDHk24 
)IjpMDHk25 
IipMDHk26 
LpMDHk27 
Iip^B:)Hk28 
IipMDHk29 
XipKDHk30 
LlM)Hk31 
IipMDHk32 
XipMDHk33 
LpNDHk34 
IipMDHk35 
IipMDHk36 
LpMDHk37 
XjpMDHk38 
IipMDHk39 
IipMDHk40 
I>pMDHk41 
i;ipMDHk42 
LpMDHk43 
LpMDHk44 
iipMDHk45 
IipMDHk46 
,IipMDHk47 
'lipMDHk48 
lipMDHk49 
IipMX2Hk50 
LpMDHkSl 
IipMDHk52 
I*pMDHk53 
I«pMDHk54 
LpMDHk55 
LpMDHkSe 
LpMDHkS? 
LpMDHk58 
LpiyiDHk59 
XipMDHkSO 
LpMDHkei 
LpMDHk62 
LpMDHk63 
LpMDHk64 
LrpMDHkeS 
XipMDHk66 



ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
ACCgCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCgCCTCAGGCSTCaiTAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCcj'TCGTAC 
ACCTCAAGGATTGTGCCA/^GGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
ACCgCAAGGATTGNGCCA7^GGCAAAGTCTGGGTCTC?J|ATATCgCCTCAGGC^TC2jTAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
GCTCAAGGATTGTGCCAAAGGCAAAGiCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
.CCTCAAGGATTGTGCCAAAGGCAT^GTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
.CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
.CC§CAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCgCCTCAGGCi|TCgTAC 
.CCTCAAGGATTGTGCCAAAGGC7Wi.GTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
.CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
.CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
.CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
.CCTCAAGGATTGTGCCAAAGGCTW^GTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
lCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
LCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGC<i:TCGTAC 
iCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGciTCGTAC 
iCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
iCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCfTCGTAC 
LCCTCAAGGATTGTGCCAAAGGCTU^GTCTGGGTCTCAGATATCACCTCAGGCCpTCGTAC 
^CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCfTCGTAC 
^CCTCAAGGATTGTGCCAAAGGCT^AAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
iwCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
!.CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGC(i:TCGTAC 
vCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGC(pTCGTAC 
VCCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
^CCTCAAGGATTGTGCCATVAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
^CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCGTCGTAC 
^CCTCAAGGATTGTGCCTW^GGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
^CCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 

i^cctcaaggattgtgccaaaggcaaagtctgggtctcagatatcacctcaggcctcgtac 
^cctcaaggattgtgccaaaggcaaagtctgggtctcagatatcacctcaggcctcgtac 
vccgcaaggattgtgccaaaggcaaagtctgggtctcagatatcgcctcaggc^tc^tac 
\cctcaaggattgtgccaaaggcaaagtctgggtctcagatatcacctcaggcctcgtac 
\ccBcgaggattgtgccaaaggcaaagtctgggtctcagatatcgcctcaggc^tc^Sta 

\CCj:jCGAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCg'cCTCAGGCyTC[dTAC 

\cctcaaggattgtgccaaaggcaaagtctgggtctcagatatcacctcaggcctcgtac 
\cc^c^aggattgtgccaaaggcaaagtctgggtctcagatatcgcctcaggc'|!tcp.tac 

ACcbcGAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATcjG^ 

ACCgCgAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCgCCTCAGGC^TC^TAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
ACCgcgAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATC§CCTCAGGc|h'CgTAC 
ACCTCAAGGATTGTGCCA-AAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
ACCSc^AGGATTGTGCCAAAGGC7Li\AGTCTGGGTCTCAGATATCGCCTCAGGcSTCpTAC| 
ACCQC&AGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATC^ScCTCAGGCp^ 
ACCdcl^AGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCGCC^ 
ACc|c&SGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCj3CCTCAGGCA|TCr^^ 
ACCGCpAGGATTGTGCCAAAGGC7\AAGTCTGGGTCTCAGATATCj^CCTCAGGCATc[^^^ 
|ACCCC^AGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATC(b^^ 
|ACCgC©AGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATC^GCCTCAGGC^^ 
|ACCGCGAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCC3CCTCAGGC|^^ 
ACCjSCj3AGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCj3^CCTCAGGC'^^ 
ACCyCgAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCgCCTCAGGcliTqijTAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGgCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
CCgC§AGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCgCCTCAGGCSTC§TAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCTCAGGCCTCGTAC 
ACCTCAAGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCACCgCAGGcbTCGTAC 
ACC§CGAGGATTGTGCCAAAGGC7^AAGTCTGGGTCTCAGATATCpCCTCAGGC?[TC^|TAC 
ACc'&&iGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCgCCTCAGGCpi^ 
ACC<|c(3AGGATTGTGCCAAAGGCAAAGTCTGGGTCTCAGATATCSCCTCAGGC^TCil^ 
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AAGGTGGCGGTGCTTGGTGCTGgCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

f^GGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGC@GGTGGCATCGG;GCAACCACTGGGCCTGCT(pATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGWGGCGGTGCTTGGTGCTGCgjGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAg 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCgGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTCpATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCT<pATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCT(pATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCMG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCT(j5ATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGgTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGAGCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCgGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AJ\GGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCT(pATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

.AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGG|gGGTGCTTGGTGCTGC|GGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTCpATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTC75JVCCACTGGGCCTGCTGATCAJVC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCT(pATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAC 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGT|§AACCACTGGGCCTGCTGATCAA( 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCT(pATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGC[|GGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

aaggtggcggtgcttggtgctgccggtggcatcggtcaaccactgggcctgctgatcaag 

aaggtggcggi>7gcttggtgctg|cggi|ggcatngg'|caaccactgggcctgct(batg 

aaggtggcggtgcttggtgctgccggtggcatcggtcaaccactgggcctgctgatcaag 

AAGGTGGCGGTGCTTGGTGCTGCCGGTGGCATCGGTCAACCACTGGGCCTGCTGATCAAG 

I^HBSBSSHHttggtgctgcSggtggcatcggtcaaccactgggcctgctgatcaag 
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MGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGgCAATGTCAAGGG;^GTCGCT 
ATGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGG^GTCGCT 
ATGTCTCCTCTgGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGG|GTCGCT 

atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaaggg^gtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattg ccaatgtcaaggg;^gtcgct 

^TGTCTccTCTGGTCTc^amaa^gaHtBHaMBna 1 1 1 1 1 1 1 1 1 1 

.tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaaggg;c^gtcgct 
.tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaaggg^gtcgct 
ltgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggic^gtcgct 

lTGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGG^GTCGCT 
iTGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGgGTCGCT 
iTGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGG^GTCGCT 
ITGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGG^GTCGCT 
IlTGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 
^TGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 
^TGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 
^TGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 
SlTGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 

^tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
\tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
i.tgtctcctctggtctcagagctgcgcctgtatgatattgcc7latgtcaagggagtcgct 
\tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 

\TGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 

\tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
\tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
a,tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
\tgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 

^TGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 

atgtctcctctggtctcagaSctgcgcctgtatgatattgccaatgtcaagggagtcgct 

ATGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 

atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
atgtctcctctggtctcaS(agctgcgcctgtatgatattgccaatgtcaagggggtcgct 

ATGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 
ATGTCScCTCTGGTCTC&iGAGCTGCGCCTGTATGATATTGCgAATGTCAAGGGpGTCGCT 
ATGTCGCCTCTGG^CTCgGAGCTGCGCCTGTATGATATTGC^^AATGTCAAGGGgGTCGCT 

Iatgtctcctctggtctcagagctgcgcctgtatgatattgcc^latgtcaagggagtcgct 
|atgtcScctctggtctcg!gagctgcgcctgtatgatattgc^aatgtcaagggSgtcgct 
^tgtcgcctctggtctcggagctgcgcctgtatgatattgcgaatgtcaaggg6gtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggSgtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggagtcgct 

ATGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 
ATGTCgCCTCTGGTCTCgGAGCTGCGCCTGTATGATATTGCgAATGTCAAGGGpGTCGCT 
ATGTCTCCTCTGGTCTCAGAGCTGCGCCTGTATGATATTGCCAATGTCAAGGGAGTCGCT 
ATGTCGCCTCTGGTCTCGGAGCTGCGCCTGTATGATATTGCpAATGTCAAGGG|GTCGCT 
ATGTCQCCTCTGGTCTcjGGAGCTGCGCCTGTATGATATTGCj^AATGTCAAGGGeGTCGCT 
ATGTCGCCTCTGGTCTCGGAGCTGCGCCTGTATGATATTGC^AATGTCAAGGGSGTCGCT 
ATGTCGCCTCTGGTCTCGGAGCTGCGCCTGTATGATATTGC<iAATGTCAAGGG|GTCGCT 

atgtcgcctctggtctcggagctgcgcctgtatgatattgcIgaatgtcaagggggtcgct 
atgtcljcctctggtctc'&agctgcgcctgtatgatattgcgaatgtcaagggggtcgct 

ATGTcldlcCTCTGGTCTCC^GAGCTGCGCCTGTATGATATTGCjG^ 

ATGTC(icCTCTGGTCTc'GGAGCTGCGGCTGTATGATATTGC.GAATGTCAAGGG^c|^ 

ATGTC(^CCTCTGGTCTc|GGAGCTGCGCCTGTATGATATTGCjGAATGTCAAGGG& 

ATGTc|qCCTCTGGTCTCg.GAGCTGCGCC§GTATGATAgTGCgAATGTC 

ATGgNTCCTCTGGTCTCAJ^AGCTGCGCcfGTATGATATTGCCAATGWCAAGGGAGTCGCT 

atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggdgtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgtcaagggSgtcgct 
atgtctcctctggtctcagagctgcgcctgtatgatattgccaatgt caagggIa gtcgct 

atgBBBBB 

ATGTCi3CCTCTGGTCTC'GGAGCTGCGCCTGTATGATATTGCi3AATGTCAAGGG|GGTCGCT 
ATGTCGCCTCTGGTCTCGGAGCTGCGCCTGTATGATATTGCgAATGTCAAGGGgGTCGCT 
ATGTCTCCTCTg'GTCTCgGAGCTGCGCCTGTATGATATlGCCAATGTCAAGGGAGTCGCT 
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GCAGATCTCAGWCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGM 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA. 
GCAGATCTgAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCilGCpGM 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCcjGCAGAA 



GCAGATCTCAGCCACTGgAACACGCCTTCTCAGGTCATGGACTTgACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTgAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCj^GCgGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGA^. 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCgGAJ^ 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 

gcagatctcagccactgcaacacgccttctcaggtcatggacttcactggcccagcaga;^, 

GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 

gcagatctcagccactgcaacacgccttctcaggtcatggacttcactggcccagcagaa 
gcagatctcagccgctgcaacacgccttctcaggtcatggacttcactggcccagcagaa 

GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCciGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGgTCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGgcCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCpCTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGC;>J\CACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 
GCAGATCTgAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCi^GCgGAA 

gcagatctcagccactgcaacacgccttctcaggtcatggacttcactggcccagcagaa 
gc^gatctcagccac;ggcaacacgcct.Sctcaggtcatggacttcactggcccggc§gaa 

GC£GATCTCAGCCACTGCAA.CACGCCTGCTCAGGTCATGGACTTCACTGGCCCgGq&-M 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCC^GCAGAA 
GCgGATCTCAGCCACTGCAACACGCCT§CTCAGGTCATGGACTTCACTGGCCC(pGCi8GAA 
GC^GATCTCAGCCACTGCAACACGCCTGCTCgGGTCATGGACTTCACTGGCCcklGCGGAA 
GCAGATCT.5)AGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCC(HgCGGAA 
GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAG7\A 

gcagatctcagccactgctvacacgccttctcaggtcatggacttcactggcccagcagaa 
gcagatctcagccactgcaacacgccttctcaggtcatgggcttcactggcccigcagaa 

GCgGATCTCAGCCACTGCAACACGCCTgCTCAGGTCATGGACTTCACTGGCCC^GCgG-AA 

gcagatctcagccactgcaacacgccttctcaggtcatggacttcactggcccagcagaa 
gcgga^-ctcagccactgcaacacgcctgctcaggtcatggacttcactggcccggcSgaa 

GC,qGATCTCAGCCACTGCAACACGCCT|cTCAGGTCATGGACTTCACTGGCCcHsGc!GGAA 

GC.65GATCTCAGCCACTGCAACACGCCT(3CTCAGQTCATGGACTTCACTGGCCC©GC*GGA^. 

GCgGATCTCAGCCACTGCAACACGCCT.qCTCAGGTCATGGACTTCACTGGCCCffiGC'GGAA 

GCGjGATCTCAGCCACTGCAACACGCCTGCTCAGGTCATGGACTTCACTGGCCCffiGCGSGAA 

GC^-GATCTCAGCCACTGCAACACGCCTqCTCAGGTCATGGACTTCACTGGCCC(&GC(3GAA 

GC<:;:GATCTCAGCCACTGCAACACGCCTGCTCAGGTCATGGACTTCACTGGCCC(ifccGGAA 

GCj<^GATCTCAGCCACTGCAACACGCCT(|cTCAGGTCATGGACTTCACTGGCCCqGC£5GAA 

GCjGGATCTCAGCCACTGCAACACGCCTPjCTCAGGTCATGGACTTCACTGGCCCfSGCGGAA 

GCgGATCTCAGCCACTGCAACACGCCr^pTGAGGTCATGGACTTCACTGGCCC;|GCa^ 

GCAgATCTCA|rNCACTGCAACACGCCTTCraAGG|CATGGACTTCACTGG|CcS^3c'£^ 

GCAGATCTgAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCCE^GCgGAZ^ 

GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 

GCAGATCTCAGCCACTGCAACACGCCTTCTCAGGTCATGGACTTCACTGGCCCAGCAGAA 



GCgGATCTCAGCCACTGCAACACGCCTGCTCAGGTCATGGACTTCACTGGCCCj^GCiGGAA 
GCgGATCTCAGCCACTGCAACACGCCTGCTCAGGTCATGGACTTCACTGGCCCGGc'GGAA 

gcagatctcagccactgcaacacgcct|ctcaggPcatggacttcactggccc|gcggm 
bbb§^;£cagccflctgcaacacgcctgctcagg;^catggacttcactggcccg^ 
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CTAGCTGACTGCTTGAAAGGTGTTGATGTTGTCGgCATCCCTGCGGGTGTNCCAAG^SSS 

^^P^'^^^cttg^ggtgttgatgttgtcgtcatccctgcgggtgtcccJJgg^g 



ctagctgactgcttg. 
ctagctgactgcttg, 



VGGTGSTGATGTTGgCglgCATCCCTGCGGNITOTCWCAAGG 
-iGGTGTTGATGTTGTCGTCATCCCTGCGGGTGTCCCAAGGAAG 



^3^^°^:°^=^=^^^°^ggtgttgatgttgtcgtcatccctgcgggtgtcccAJgg^^ 

^T^o^^^'''^^^^^^°'^''°™^'^°^'^°'^^°'rCGTCATCCCTGCGGGTGTSctSGG^^ 
^TAGCTGACTGCTTGAAAGGTGTTGATGTTGTCGTCATCCCTGCGGGTGTcicSGGMG 



^r;? "^^^ ^'^"^r^'^^ -^^^ TGATGTTGTCGTCATCCCTGCGGGTGTCCC^^ 

TAGCTGACTGCTTGAiAGGTGTTGATGTTGTCGTCATCCCTGCGSTC?c?cKGG^G 
TAGCTGACTGCTTGAAAGGTGTTGATGTTGTCGTCATCCCTGCGGG^CTCCciJG^ 
™°"°^'^^°^T'^<2'^°GTGTTGATGTTGTCGTCATCCCTGCGGGiG?rcctjGGSG 
TAGCTGACTGCTTGAAA.GGTGTTGATGTTGTCGTCATCCCTGCGGGTGTCCct^GGj^G 
^^°^^°''^^^'=^^°^'^°^°^T«^TGTTGTCGTCATCCCTGCGGGTGTCCctJGG5JG 

:™°^^°^^'^°^^t°'^°gtgttgatgttgtcgtcatccctgcgggtgSccKgg^g 
tagctgactgcttgaaaggtgttgatgttgtcgtcatccctgcgggtctIc^Kgg^g 
tagctgactgcttgaaaggtgttgatgttgtcgtcatccctgcgS?ctcc?I^SJJg 
tagctgactgcttgaaaggtgttgatgttgtcgtcatccctgcSgtctcccJ^ggSS 
tagctgactgcttgaaaggtgttgatgttgtcgtcatccctgcgggtctcSJ^ggJJg 

.^^^^oSS'''^^*^^^^'^'^^°°'^°^^SGATGTTGTCGTCATCCCTGCGGGTGTCCCAAGGAAG 

^^^^^^''St^cttgaaaggtgttgatgttgtcgtcatccctgcgggtgtcccKSg 

^^°^^°^^^°'=^^°^-^G°^'3T^GATGTTGTCGTCATCCCTGCGSTSiccSGGjjG 
^^^^fe-^°=^^°^°#^^^'^^°TT«T^'^BMATCCCTGCGGGTSi?c£^G^ 



'^^^T-^^^'^^^^^^T^^^^TTGTCGT^ATCCCTGCGGGTO^ 

n^^SSI^^S^^*=^'^°'^°°§°^§='^T°TTGTCGTCATCCCTGCGGGTGTCCcWAAG 

^™^^P^?T°CTTGAAAGGTGTTGATGTTGTCGTCATCCCTGCGGGTOTCCcKS 

P^^^^I^-''-^°^''''°'^^^S<=^i°^^°TT«TCGTCATCCCTGCGGGTCTS?^^^ 

^3^^^^|2^S^'^^^^°^GG,eGT!3GATGTTGTCGTCATCCCTGCGGGTGTCCCAAGGAAG 

CTAGC^feAgTGCTTGAAAGG(^GTeGATGTTGTCGTCATCCCTGCGGGTCTrccW 

™Sf^'^°^^T°^G#TiGATGTTGTCGTCATCCCTGCGSTCTcSf^GGSG 

"^^^I^^^^^^^TGAAAGGGGTiG'GATGTTGTCGTCATCCCTGCGGG^GTC 

o^^^l^''°''''^'"''°'^°#^'^=^T°T^«^C°^'=^TCCCTGCGGGTGTC?cW 

CTAGC^GAGTGCTTGAAAGGGGTbGATGTTGTCGTCATCCCTGCGGGTGTCCckJSM^ 

^^J^^^^^J^^^^^^^gGTCcS^ 

ctagctgactgcttgaaaggtgttgatgttgtcgtcatccctgcgggtgtcccKagg^g 
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LpMDHkS 1 : 
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LpMDHkS 5 : 
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CCAG^CATGACCCGTGATGACCTTTTTAA.CATCAATGCGGGCATCGNWAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCgiGGCATGACgjCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATg 
CCAGGCATGACCCGTGATGACCTTTTTAACATCT^TGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCA^GACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCqCTTATT 



CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCgGGCATGACgCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATl: 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCT^GTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCTIAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCMGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGWCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCgXGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATC?iATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTcdcTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCMGTCC^CTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCchcTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTA^CATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCg|GGCATGACgCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTC(SCTTATg 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGBCATCGTCBAGNNGCTTATg 
CCAGGCATGACCCGTGATGACCTTTgTAACATCAATGCGGGCATCGKCAAGNCGCTTATlt 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTC(bcTTATT 

ccaggcatgacccgtgatgacctttttaacatcaatgcgggcatcgtcaagtcgcttatS 

CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTC(!3CTTAT!d: 
CCgjGGCA'TGACgCGTGATGACCT'ITTTAACATCAATGCGGGCATCGTCAAGTCGCTTAT^ 
CCAG|CAfecffiGCC|TGAgG'GC|TgJ^^T:^^ 

CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCG^^ 

CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 

CCAGGCATGACCCGTGATGACCTTTTTT^CATCAATGCGGGCATCGTCAAGTCGCTTATp 

CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 

CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATg 

CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATli: 

ccaggcatgacccgtgatgacctttttaacatcaatgcgggcatcgtcaagtcgcttatI 
ccaggcatgacccgtgatgacctttttaacatcaatgcgggcatcgtcaagtcgcttatI 

CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTAT'<r 
CCAGGCATGACCCGTGATGACCTTTTT7iACATCAATGCGGGCATCGTCAAGTCGCTTAT| 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTcbcTTATG 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTC^iAGTCGCTTAT^C 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATp: 
CCAGGCATGACCCGTGATGACCTTTTTAACATCpATGCGGGCATCGTCAAGTCGCTTAT'C 



CCgtGGCATGAC|ljCGTGATGACCTTTTTAACATCAATGCGGGCATCGgCAAGTCGCTTAT;(r 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTcbcTTATT 



CCAGGCATGACGCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATG 
CCAGGCATGACCCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTAT^^ 
CCgjGGCATGACSCGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTCGCTTATT 
CCiliGGCATGAC[ijcGTGATGACCTTTTTAACATCAATGCGGGCATCGTCAAGTc!GCTTATT 
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[TGCAGACAACTGCCCTGAGGCC 



GAGGCTGTTGCAGACAACTGCCC^ 
GAGnrTriTTnr ar: a n 7\ 
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l'GAGGCCTTCATCCATATCATCAGCAACCCGGTCAAC 
rGAGGCCTTCATCnATATrZiTr'ar::/-^ 7xr'rn^^r.m/-,.v ^ ^ 



5»GGCTGip>GCAG«CAACTGCCCroAOGCCTTC»TC™TATCA?SSJJr^^^^ 



3AGGCTGTTGCAGAC 



;AGGCTGTTGCAGACAACTGCCCTGAGGCCTTCATrrAT^Tn.™o^S^^AS!S^S 



;AGGCTGTTGCBBB^^^B^^^^^^^M^MiwMaSSSlSSfl 
^AGGCTGTTGCAGACAACTGCCcSS^^^^^^^^^ 

iAGGCTGTTGCAGACAACTGCCCTGAGGCCTTCAT^^^^^^^^^^^^ 

;AGGCTGWTGCAGACAACTGCCCTGAGGCCTTCA™rATA;^^^r^S^^^S^!2^S^ 



;A™S^a1aC^C;gC^^ 

lAGGCTGTTGCAGACAACTGCCCTGAGGCCTTCATCCATATCATCAGC^JcCcISc^r 
lAGGCTGTTGCAGACAACTGCCCTGAGGCCTTCATCCATATSSSi^^cSrTr^r 

Ia^^G^^^IS^^^^^ 

AGGCTGTTGCAGACAACTGCCCTGAGGCCTTCATCCATATS?CAG?^S3GTC^r 

;aggctgttgcagacaactgccctW5gccttcatccatatcatcag^^^^^S 

Iag^^^S^^^^^^^ 

i^^^^^^^^^'^^^^^CTGCCCTGAGGCCTTCATCCATATajATCAGCAk^^^S 

:^°°^™2;t'5cagacaactgccctgaggccttcatccatatCtcag?^cS£tcSc 

AGGCTGNTGCAGACAACTGCCCTGAGGCCTTCATCCATAT,^kTCAGCAACCr^^T^^ 

-aggctgttgcagacaactgccctgaggccttcatccatatS?SS^cc?ES 
^gaggctgttgcagacaactgccctgaggccttcatccatatIatSg^I^ccSStc^^ 
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|LpMDHk47 
LpMDHk48 
LpMDHk49 
L|^Hk50 
LpMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHk54 
LpMDHkSB 
LpMDHkS 6 
LpMDHkS 7 
LpMDHkS 8 
LpMDHkS 9 
LpMDHk60 
LpMDHkSl 
LpMDHk62 
LpMDHk63 
LpMDHk64 
LpMDHkeS 
LjE^Hkee : 



740 



760 



780 



'^g^^^^G'^ GCCGATTGCTGCTGAGATTCTGAAACAGAAGGG CGTCTACAACCCCAAGAAC 
TMCACTGTGA 



™CACTGTGCCGATTGCTGCTGA 
TCCACTGTGCCGATTGCTGCTGA ^ 

TCCACgGTG CCGATTGCTGCTGAGATTCTG AAACAGAAGGGCGTNTACAACCrraAno^ 

Ti^^cACTGTGflpjBmmjmj^"" — 

TNCACTGTGCCGATTGCTGCTGAG 



tccactgtgccgattgctgctgagatJ 

TWCACTGTGCCGATTGCTGCTGAGAT^ 



TCCACTGTGCCGATTGCTGCTGAGATTCTGAAACAGAAGGGCGTCTACAACCCtAAGAAG 

tccactgwgccgattgctgctganattctgaaacahaagggcg&T5tacaacccc^ 

TCCACTGTGCCGATTGCTGCTGAGATTCTGAAACAGAAGGGCGTCTACAACCcbAAGAAGl 

tccactgtgccgattgctgctgagattctgaaacagaagggcgtctacaaccccaagaag 
tccactgtgccgattgctgctgagattctgaaacagaagggcgtctacaaccc!caagaagI 



TCCACTGTGCCGATTGCTGCT i 

TgCACTGTGCCGATTGCTGCTGAGATTCTGAAAN 

tccactgtgccgattgctgctgagattctgaaacagaagggcgtctacaaccc-caagaag 

TCCACTGTGC CGATTGCTGCTGAGATTCTGAA ACAGAAGGGCGTCTACAAPrr^Aannzir: 
TgCACTGTGCE^ 
MCACTGTGCCGATTGg 
T^lcACTGTGCCGATTGCTG 




TN^CACGGTGCCGAT 
TCCACGjGTGCCGATTGCTGC^eGAGA 
TCCACGGTGCCGATTGCTGCTGAGA 



TWCACTGTGCCGATTi 

TWCACTGTGCCGATTGCTGCTGtc 



CTGCiGAGATTCTG 



GGGCGTCTACAACCCtAAGAAG 



TgCACSGTGCCGATN 

TCCACdGTGCCGATOGCTGCiA'GAGATTCTGAAACAGAAGGGCGTCTACAACCC'CAAGAAG 
TCCAC6GTGCCGATTGCTGOA'GAGATTCTGAAACAGAg, 

TCCACfSGTGCCGATTGCTGC'^GAGATTCTGAAACAGAAGGGCGTCTACAACCC'CAAGAAG 
TWCACGGTGCCGATTGCTGC&AGATTCTGAAACAl 
TCCACdGHGCCGATTG^TGci^ASATTiliTG^i 



TCCACTGTGCCGATTGCTGCTGAGATTCTGAAACAGAAGGGCGTCTACAACCCCAAGAAG 
TjgCACTGTGCCGATTGCTGCTGAGATTCTGAAACAGAAGGGCGTCTACAScCCCAAGAAG 



T 

TWCACgGTGCCGATTGCTGCgGAGATTCTGAAACAGAAGGGCGTCTACAACCC'C _ 

TCCACTGTGCCGATTGCTGCTGAGATTCTGAAACAGAAGGGriGTCTACAACCc'cAAGAAG 
££CACTGTGCCGATTGCTGCTGAGATTCTGAM 



680 
726 
702 
723 
707 
687 

642 

702 
69S 

630 
671 
407 
•294 




LpMDHkl 
LpMDHk2 
IipMDH)c3 
LpMDHk4 
LpMDHkS 
LpMDHke 
IipMDHky 
LpMDHkS 
lipMDHk9 
LpMDHklO 
LpMDHkll 
LpMDHkl2 
LpMDHkl3 
LpMDHkl4 
LpMDHklS 
LpMDHkie 
LpMDHkl? 
LpMDHklB 
LpMDHklS 
LpMDHk20 
LpMDHk21 
LpMDHk22 
LpMDHk23 
LpMDHk24 
lLpMDHk25 
LpMDHk26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDHkS 0 
LpMDHkS 1 
LpMDHk32 
LlM3Hk33 
LpMDHk34 
LpMDHk35 
LpMDHk36 
LpMDHkS 7 
LpMDHk38 
LpMDHkS 9 
LpMDHk4 0 
LpMDHk41 
LpMDHk42 
' LpMDHk43 
LpMDHk4 4 
LpMDHk45 
LpMDHk46 
jLpMDHk4 7 
LpMDHk48 
LpMDHk49 
LpMDHkS 0 
LpMDHkSl 
LpMDHk52 
LpMDHkSS 
LpMDHk54 
LpMDHkS 5 
LpMDHkS 6 
LpMDHkS 7 
LpMDHkSS 
LpMDHkS 9 
LpMDHkS 0 
LpMDHkSl 
LpMDHk62 
LpMDHkSS : 
LpMDHkS4 
LpMDHkSS 
LpMDHkSS 



800 



820 



840 



CTCTTCGGGGTTTM CACCCTGGATGTTGTrAr.AnrTaz^nzirn^^nPr.^An_^^r.^jj/n 



CTCTTCGGGGTTTCCACCCTGGATGTTGTCAGAGCTAACACATTTGTAGCTC 



CTCTTCGGGGTTTCCACCi 

CTCTT^GGGGgTTNCACCCTG 

ctcttcggggtttccaccctggatgttgtcaja. 
ctcttcggggtttccaccctggatgttgtcaI 

CTCTTCGGGGTTTCCACCCTGGATGTTGTCAGA GC TAACACATTTGTAGCTCAJNAAGAAG 



ctcttcggggtttccaccctggatgttgtcagaJ 

CTCTTCGGGGTTTCCACCCTGGATGTTGTCAGAr 



CTgTTCJ^GGGF^TTgCACCCTGGATGTTGg]C^ 

CTCTTCGGGGMTTCCACCCTGGATGTTGTCAGAGCTAArAPa-r-r-rrz-rnrrn^ 



CTCTTCGGGGTTTWCACCCTGGATGTTGTCAGAGCTAAC; 



CTCTTCGGGGTTTC 



CTCTTCGGGGTTTCCACCCTGGATGTTGTCAGC^GCTAArArATT 



CTCTTCGGGGTTTCC.C5 CCCTGGATGTTGTCAGG'GCTAACArATTTP,TAnr^^CA^- 



801 



801 



758 
7S1 
772 
772 
800 



771 
773 



764 
785 



786 



JTCTTCGGGGTTTCCACCCTGGATGTTGTCAGAGCTAACACATTTGTAGCTCAGAAGAAG 
CTCTTCGGGGTTTCCACCCTGGATGTTGTCAGAGCTAACACATTTGTAGCTCAGAAGAAG 



741 

770 
777 



762 
701 



467 
354 




860 



IjpMDHkl 
LpMDHk2 
LpMDHk3 
LpMDHk4 
LpMDHkS 
LpMDHke ' 
LpMDHk? 
LpMDHkS 
LpMDHkS 
LpMDHklO 
LpMDHkll 
LpMDHkl2 
LpMDHkl3 
IipMDHkl4 
LpMDHkis 
LpMDHkl6 
LpMDHkl? 
• LpMDHkis 
LpMDHkis 
LpMDHk20 
LpMDHk21 
LpMDHk22 
LpMDHk23 
LpMDHk24 
)LpMDHk25 
LpMDHk*26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDHk30 
LpMDHkS 1 
LpMDHkS 2 
LpMDHkS 3 
LpMDHk34 
LpMDHkS 5 
LpMDHkS 6 
LpMDHk37 
LpMDHk3B 
LpMDH3c39 
LpMDHk40 
LpMDHk41 
LpMDHk42 
LpMDHk43 
LpMDHk44 
LpMDHk45 
LpMDHk46 
.|LpMDHk47 
LpMDHk48 
LpMDHk49 
LpMDHkS 0 
LpMDHkSl 
LpMDHk52 
LpMDHH53 
LpMDHkS 4 
LpMDHkS 5 
LpMDHkS 6 
LpMDHkS 7 
LpMDHkS 8 
LpMDHkS 9 
LpMDHkS 0 
LpMDHkSl 
LpMDHkS 2 
LpMDHkSS 
LpMDHk64 
LpMDHk65 
LpMDHk66 



880 



900 



802 



AACCTCA 



807 



AACCTCAG3;JCTp3ATCG 



802 



767 



AACCTCAGCCTCATCGATGTTGATGTCCCAGTTGTCGGTGGCCATGCTGGGATCACGATT 
AACCTCAGCCTCATCGATGTTGATGTCCCAGTTGTCGGTGGCCATGCTGGGAT(PACGATT 



920 



940 



9Hkl 
LpMDHk2 
LpMDHk3 
LpMDHk4 
LpMDHkS 
LpMDHkS 
LpMDHk? 
LpMDHkB 
LpMDHkS 
LpMDHklO 
LpMDHkll 
LpMDHkl2 
LpMDHkl3 
L£^Hkl4 
LpMDHklS 
LpMDHkie 
LpMDHkl? 
LpMDHkia 
LpMDHkl9 
LpMDHk20 
LpMDHk21 
LpMDHk22 
IipMDHk23 
LpMDHk24 
)LpMDHk25 
LpMDHk26 
LpMDHk27 
IipMDHk28 
LpMDHk29 
L.pMDHk30 
LpMDHkB 1 
L£dyiDHk32 
LpMDHkB 3 
LpMDHk34 
LpMDHkB 5 
LpMDHkS 6 
LpMDHkB 7 
LpMDHkB 8 
LpMDHkB 9 
LpMDHk40 
LpMDHk41 
LpMDHk42 
LpMDHk4B 
LpMDHk44 
LpMDHk45 
LpMDHk46 



960 



.LpMDHk47 
LpMDHk48 
LpMDHk49 
LpMDHkS 0 
LpMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHk54 
LpMDHkS 5 
LpMDHkS 6 
LpMDHkS 7 
LpMDHkS 8 
LpMDHkS 9 
LpMDHkS 0 
LpMDHk61 
LpMDHk62 
LpMDHkSB 
LpMDHk64 
LpMDHk65 
LpMDHk66 



587 
474 




980 



1000 



1020 



LpMDHkl 
XipMDHk2 
LpMDHk3 
LpMDKk4 
LpMDHkS 
LpMDHk6 
ZipMDHk? 
LpMDHkS 
IjpMDHk9 
LpMDHkl 0 
LpMDHkll 
LpMDHkl2 
LpMDHkl3 
LpMDHkl4 
LpMDHkl 5 
LpMDHkl 6 
LpMDHkl7 
LpMDHJclB 
LpMDHkl 9 
LpMDHk20 
LpMDHk21 
LpMDHk22 
LpMDHk23 
LpMDHk24 
lLpMDHk25 
LpMDHk26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDH3c30 
LpMDHk31 
LpMDHkS 2 
LpMDHk33 
LpMDHk34 
LpMDHk35 
LpMDHk36 
LpMDHkS 7 
LpMDHk38 
LpMDHk39 
LpMDHk40 
LpMDHk41 
LpMDHk42 
LpMDHk43 
LpMDHk44 
LpMDHk45 
LpMDHk46 
.LpMDHk47 
LpMDHk48 
LpMDHk49 
LpMDHkS 0 
LpMDHkBl 
LpMDHkS 2 
LpMDHkS 3 
LpMDHk54 
LpMDHkS 5 
LpMDHkS 6 
LpMDHk57 
LpMDHkS 8 
Lf^Hk59 
LpMDHk60 
LpMDHk61 
LpMDHk62 
LpMDHk63 
LpMDHk64 
LpMDHkGS 
LpMDHk66 



CTGACAAAGAGGATACAGAACGCTGGGACAGAGG^GGTGGAGGCGA-Ai 
o^r..,o.p.^.^^.^,vr-ii^.:.z>rr,PTnnni>rAGAGGCGGTGGAGGCGAAGGCTGGirGCTGGC 



634 
534 




LpMDHkl 
LpMDHk2 
LpMDHka 
IipMDHk4 
LpMDHkS 
lipMDHke 
LpMDHk? 
LpMDHke 
LpMDHk9 
LpMDHkl 0 
LpMDHkl 1 
LpMDHkl 2 
LpMDHkl3 
LpMDHkl4 
LpMDHkl 5 
. LpMDHkl 6 
LpMDHkl? 
LpMDHkl 8 
LpMDHkl 9 
LpMDHk20 
LpMDHk21 
LpMDHk22 
LpMDHk23 
• LpMDHk24 
I L£^Hk25 
LpMDHk26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDHk30 
LpMDHk31 
LpMDHk32 
LpMDHk33 
LpMDHk34 
LpMDHk35 
LpMDHk36 
LpMDHk37 
LpMDHk38 
LpMDHk39 
IipMDHk40 
LpMDHk41 
LpMDHk42 
IipMDHk43 
IipMDHk44 
L|^Hk45 
LpMDHk46 
LpMDHk47 
LpMDHk48 
LpMDHk49 
LpMD'HkSO 
LpMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHk54 
LpMDHkSS 
LpMDHkSe 
CipMDHkS? 
:^pMDHk58 
apMDHk59 
!ipMDHk60 
jpMDHkei 
ipMDHk62 
ipMDHk63 
<pMDHk64 
ipMDHkes 
■pMDHk66 



1040 



* 10.60 



1080 



rCTGCTACTCTGTCCATGGCTTATGCCGCTGCCAGATTTGTTGAGTCATCGCTb?^^^ 



: 594 




1100 



1120 



LpMDHkl 
IipMDHk2 
LpMDHkS 
LpMDHk4 
LpMDHkS . 
LpMDHke 
LpMDHk? 
LpMDHke 
IipMDHk9 
LpMDHklO 
LpMDHkll 
Ii£^Hkl2 
LpMDHkl3 
LpMDHkl 4 
lipMDHklS 
LpMDHkl 6 
LpMDHkl? 
LpMDHkl 8 
LpMDHkl 9 
LpMDHk20 
LpMDHk2l 
LpMDHk22 
LpMDHk23 
LpMDHk24 
}LpMDHk25 
LpMDHk26 
LpMDHk27 
^ LpMDHk28 
' LpMDHk29 
LpMDHk30 . 
LpMDHkBl 
LpMDHk32 
LpMDHk33 
LpMDHk34 
LpMDHk35' 
LpMDHk36 
LpMDHk37 
LpMDHk38 
LpMDHk39 
LpMDHk40 
LpMDHk41 
IipMDHk42 
LpMDHk43 
LpMDHk44 
LpMDHk45 
LpMDHk46 
iLpMDHk47 
LpMDHk48 
LpMDHk49 
LpMDHkS 0 
LpMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHkS 4 
LpMDHkS 5 
LpMDHkS 6 
LpMDHkS? 
LpMDHkS 8 
LpMDHkS 9 
LpMDHke 0 
LpMDHkS 1 
LpMDHk62 
LpMDHkS 3 
LpMDHk64 
LpMDHkSS 
LpMDHkS 6 



1140 



ATGGCTGGT GATCCAGATGTTTACGAGTGCACGTATGTTCAGTCTGAGTTT^Cj^nAnn 



654 



XjpMDHk2 
LpMDHk3 
IipMDHk4 
LpMDHkS 
XipMDHke 
LpMDHk? 
LpMDHke 
LpMDHk9 
IipMDHklO 
. LpMDHkll 
i:jpMDHkl2 
XipMDHkr3 
LpMDHkl4 
LpMDHklB 
LpMDHklg 
IipMDHkl?- 
LpMDHklB 

LpMDHk20 
LpMDHk21 
IipMDHk22 
IipMDHk23 
LpMDHk24 
) LpMDHk25 
LpMDHk26 
LpMDHk27 
LpMDHk28 
LpMDHk29 
LpMDHk30 
LpMDHkSl 
LpMDHk32 
LpMDHk33 
LpMDHk34 
IipMDHk35 
IipMDHk36 
. LpMDHk37 
LpMDHk38 
LpMDHk39 
IipMDRk40 
LpMDHk41 
LpMDHk42 
LpMDHk43 
LpMDHk44 
LpMDHk45 
LpMDHk46 
jLpMDHk47 
IjpMDHk48 
IipMDHk49 
.LpMDHkSO 
LpMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHk54 
LpMDHkSB 
LpMDHk56 
LpMDHkS? 
LpMDHkSB 
LpMDHkS 9 
LpMDHke 0 
LpMDHke 1 
LpMDHke 2 
LpMDHk63 
LpMDHk64 
LpMDHke 5 
LpMDHkee 



1160 



1180 * 1200 



CCAlTCTOCGCGTCCAGAGTT/^GCTTGGGAAGGACGGNGTTGAGTCCATCATt 



714 




LpMDHkl 
hpm}3k2 
LpMDHka 
LpMDHk4 
LpMDHkS 
LpMDHke 
LpMDHk? 
LpMDHkS 
LpMDHk9 
LpMDHk^O 
LpMDHkll 
LpMDHkl2 
IipMDHkl3 
I<pMDHkl4 
LpMDHkXS 
LpMDHkl6 
LpMDHkl? 
LpMDHklS 
IipMDHkl9 
LpMDHk20 
LpMDHk21 
IipMDHk22 
LpMDHk23 
LpMDHk24 
)LpMDHk25 
LpMDHk26 
I<pMDHk27 
IipMDHk26 
LpMDHk29 
LpMDHkaO 
LpMDHkS 1 
LpMDHk32 
LpMDHk33 
LpMDHk34 
LpMDHkS 5 
LpMDHk36 
LpMDHk37 
LpMDHk38 
LpMDHkS 9 
LpMDHk40 
LpMDHk41 
LpMDHk42 
LpMDHk43 
LpMDHk44 
LpMDHk45 
LpMDHk46 
jLpMDHk47 
LpMDHk48 
LpMDHk49 
LpMDHkS 0 
LpMDHkSl 
LpMDHk52 
LpMDHk53 
LpMDHk54 
LpMDHk55 
LpMDHk56 
LpMDHkS? 
LpMDHkSS 
LpMDHkS 9 
LpMDHkeO 
LpMDHk61 
0pMDHk62 
LpMDHk63 
CipMDHk64 
[/pMDHk65 
:<pMDHk66 



1220 



1240 



1260 



GACCTGGAGGGAGTGACGGAGTACGAGGCCAAGGCGCTTGANGCATTGAAGGCtG 



774 



^^^^ 

' XjpMDHkl 


: - - 


IjpMDHk2 


: 


LpMDHk3 


: - — 


LpMDHk4 


* 


LpMDHkS 


: ~ — 


LpMDHkS 


; - — 


LpMDHk? 




LpMDHkB 


• ^ — • 


LpMDHkS 


z - - . 


lipMDHklO 


: - - ■ 


LpMDHkll 


- 


LpMDHkl2 




. IipMDHkia 


: 


LpMDHkl4 


J 


LpMDHklS 


• 


LpMDHkl6 


: 


LpMDHkl? 


: --- 


LpMDHklS 


• 


LpMDHklS 


: 


LpMDHk20 


: 


LpMDHk2l 


: 


LpMDHk22 


: 


LpMDHk23 


: 


LpMDHk24 


: 


)LpMDHk25 


5 


LpMDHk2j5 


; 


LpMDHk27 





LpMDHk28 


: 


LpMDHk2S 


: 


LpMDHkS 0 


: 


LpMDHk31 


: 


IjpMDH]c32 


; 


Lj>MDHk33 


■ 


LpMDak34 


: 


LpMDHk35 


5 


LpMDHk36 


. 


LpMDHkS 7 




Lf^Hk38 





LpMDHk3S : 





LpMDHk40 : 





LpMDHk41 : 





LpMDHk42 : 





LpMDHk43 : 





LpMDHk44 : 





LpMDHk45 : 





LpMDHk46 : 





jLpMDHk47 : 





LpMDHk48 : 


- 


LpMDHk4S : 





LpMDHkS 0 : 





LpMDHkS 1 : 




LpMDHk52 : 





LpMDHk53 : 




LpMDHk54 : 




LpMDHkSS : 




LpMDHk56 : 




LpMDHk57 : 


LpMDHkSS : 




LpMDHkSS : 




LpMDHkeO : 




LpMDHkS 1 : 




LpMDHk62 ; 




LpMDHk63 : 




LpMDHk64 : 




LpMDHkSS : 




LpMDHkSS ; 






re 38 Nucleotide sequence of LpMRHl 



* 20 * 40 . * 60 
LpMDHl : GNAAACAGimGCGNCTTTTCCTNCANTGTTGCCGTGCAATaSCTGiU^^ : 60 

' * 80 * . 100 . ' * 120 

LpMDHl : TCATATACGAGGAAGTAATTATTGATAACTGCTGTATGACXSCTCGTC^ s 120 

* 140 * 160 . * 180 
LpMDHl . : CGTTTGATGTATTAGTGATGCCAAATCTATATGGCQAOVTTATTAGTGATCTATGTGCTC : 180 



* 200 * * 220 * 240 

LpMDHl : GTTTGATCGGAGGCTTGGGCCTAACTCCCAGCTGGAACATTGGTQAAGGTGGCyVTTTGTC 



240 



* 260 * 280 * 300 

LpMDHl :. TTGCMAGGCTGTCCATGGCTCTGCT^CCTGATATATCTGGCaVAGA^ 



300 



* 320 * 340 * 360 

LpMDHl : Cn^CTCTTATGCTGAGTQCTGTTATGATGTTGCGCCACTTGCAATl^^ 



360 



* 380 * 400 * 

LpMDHl : AACGGATCCACAATGCTATCCTCCAGACTATCGNCGAGGGGAAGNAGANAACTG •: 414 



^jP^e 39 Deduced amino acid sequence of LpMDHl • 

* ♦ 20 * 40 * 60 
lipMDHl : KQXXIiFXXCCRAIJOCKyPEIIYEEVIIDNCOlTLVKNPGTFDVL - 

* 80 * 100 * 120 
LpMDHl : LIGGLGLTPSCNIGEGGICIAEAVHGSAPDISGKOTA^ • 

* 

LpMDHl ; RIHNAILQTIXBGKXXT : 137 




e 40 Nucleotide secpience o£ LpMDHm 



* 20 . * 40 * 60 

LpMDHm : GNOVCOTCCAGNNACAACTCTGGTACCTCAATTGCT : 60 



XipMDHm 



* 80 * 100 * 120 

AATCCACTACACAGCraCGAGCTACCCCGCCCCCGCAATCCAAACTACCT 



120 



LpMDHm 



* 140 * 160 * 180 

AATCTAOU^CATGAAGGCAGTCGTAGCTGGAGCCGCCGGTGGCATTGGAaVGCCAT^ 



180 



LpMDHm 



* 200 * 220 * • 240 

CCTCCTCCTTAAGACCTGCCCGCTCGTCACTGAGCTCGCCCTATACGATGTCGTCAACGC 



240 



* 260 * 280 • * 300 

LpMDHm : CGTCGGTGTCGCGACTGACCTCTCCCaVCATCTCCTCGCCCGCGAAAGTAACCGGCTACCT 



300 



) 



* 320 * 340 * 360 

LpMDHm : GCCGGCAAATQACGGTATGCAGCAGGCTCTCACTGGCGCCGACATCGTC 



360 



LpMDHm : 



* 380 . * 400 * ■ 420 

TGGTATTCCCCGCAAGCCCGGCATGACCCGTGACGACCTCTTCAAGATCAACGCCGGCAT 



420 



LpMDHm 



* 440 * 460 * 480 

TGTCCAGGGTCTCATCGAGGGTGTCGCCAAGCACTGCCCCAAGGCATACGTTCTCGTC^^ 



480 



LpMDHm 



* 500 *- 520 *. 540 

CTCCAACCCCXSTa^CTCGACTGTGCCCa^TCGCaSCCGAGGTGCTGAAGAAGGCC^ 



540 



LpMDHm 



* 560 * 580 * 600 

CTTCGACCCCAAGAAGCTCTTCGGTGTCACCACCCTCGATGTC6TCCGCGCCGAGACCTT 



600 



* 620 * 640 * 

LpMDHm : CGTTGCCGAGATCACTGGCGAGAAGGACCCAGCGAAGTTGAACATNCCCGTA 



652 



^l^re 41 • Deduced amino acid sequence of LpMDHm 



* , 20 * 40 * 60 

IipMDHm : XXPXTTLVPQLLIiHTSLLLPIHyTASSYPAPAIQTTSP*QIYlsmKAVVAGAAGGIGQPLS : 59 



* 80 * 100 * 120 

LpMDHin : LLIiKTCPIjWEIiALYDVVNAVGVATDIiSHISSPAKOTGyLPAND^ : 119 



* 140 * 160 * 180 

LpMDHm : GIPRKPGMTIUDDLPKINAGIVQGLIEGVAKHCPKAyVLVISNPVNSTVPIAAEVIjK^ : 179 



* 200 * 

lipMDHm : FDPKKLFGVTTIiDVVRABTFVAEITGEKDPAKIiNXPV : 216 



^l^re 42 Nucleotide sequence of LpPEPCa 



* 20 * 40 * 60 
LpPEPCa : GNGTACACGAAATAGAATCAACGGAAAGCANGAAGTGATGATTGGGTATCyVGCA : 60 

* - 80 • . * 100 * 120 
LpPEPCa : GAAGGATGCTGGCCGTTTCTCroCTGGTTGGCACTTGTACAAAGCTO : 120 



* 140 * 160 * • 180 
LpPEPCa : TAAGGTTOCGGAGACGTTTGGGGTTAAGNTGACTATGTTTCATGGAC^ : 180 

* 200 * 220 * 240 
LpPEPCa : TGGAAGAGGTGGCGGCC'CTACCCATCTTGCTATACTGTCACAACCTCCAGATACTGTCCA : 240 



• * 260 * 280 * • 300 

LpPEPCa : TGGATCACTTCGGGTAACTGTTCAAGGTQAAGTCATTGAGCAGTCCTO : 300 



. * 320 * 340 * 360 

LpPEPCa : TTTGTGTTraAGAACGCTTCAACGTTTTACAGCTGCTACT : 360 



* 380 * 400 *• 420 

LpPEPCa : ACCAATCTCACCTAAACCAGAATGGCGTGCTTTGATGGATGAAATGGCTGTTGTTGCCAC : 420 



* 440 * 460 * 480 

LpPEPCa : AGAGGAATACCGTTCCATTGTTTTCCAAG^CCAAaATTTQTTGAGTAT^ : 480 



* 500 * 520 * 540 

LpPEPCa : AACACCAGAGCTCGAGTATGGTAGGATGAATATTGGAAGCAGGCCATCAAAACGT^ : 540 



* 560 * 580 * 600 

LpPEPCa : AAGCGGAGGAATCGAATCATTGCGTGCAATTCCTTGGATATTTGCTTGGACACAGACTAQ : 600 



* 620 * 640 * 660 

LpPEPCa : ATTCCACCTGCCAQTGTGGCTTGNTTTTGGTGCGGCCTTCAAGCATC : 660 



* 680 * 700 * 720 

LpPEPCa : CATTCGTANTCTTCAAATCCTTCAGCAGATGTACAACGAGTCGCCGl^ : 720 



* 740 * 760 

LpPEPCa : AAACCTGGTTGAGATGGTGTTTGCCAAGGGCGATCCAGGTATAGCAGCT : 76.9 



^j^^e 43 Deduced amino acid sequence of LpPEPca 



* -20 . * ■ 40 * 60 

LpPBPCa : XTRNRINGKXEVMIGYQHSGKDAGRFSAGWHLYKAQEELIKVAETF^^ : go 



* 80 * 100 * 120 

LpPEPCa : GRGGGPTHIiAILSQPPDTVHGSLRVWQGEVIEQSFGEEHIiCPRTLQRPTAATIi^ : 120 



* 140 * 160 * 180 

LpPBPCa : PISPKPBWRAIiMDEM?lVVATEEYRSIVFQEPRFVEYFRI^^ : 180 



* 20C * 220 * 240 

LpPEPCa : SGQIESLRAIPWIFAWTQTRFHLPWLXPGAAFKHVLQKDIRXLQILQQMTraEWPFRVTI : 240 



LpPBPCa : NLVEMVPAKGDPGIAA : 256 




e 44 Consensus contlg nucleotide sequence of LpPBPCb 



* 20 * 40 * 60 
LpPEPCb : GAAGAAGTTGCTGATGTTTTAZWSNACaVTTTNTO : go 

* • 80 * 100 * 120 
IjpPEPCb : ITOGTGCTTACATOlTCTGAATGGCyUVCTGCCCCATCTGATGTGCTTC : 120 

* 140 * 160 * 180 
LpPEPCb : ItKaWXXSGGAGTGCCATATAAAAAAGCCATTGAGAGTTO : 180 

* • . 200 * 220 ' * 240 
LpPEPCb : CAGATCTTGAANCAGCTCCAGCATCTGTTGCACGACTATTTTCAATAGACTGGTACATGA : 240 

* 260 * 280 * • 300 . 
LpPEPCb : ATAGAATCAATGGCAAGCAGGAGGTCATGATTGGATACTCAGACTCTGGGAAGGAC^ : 300 

^ * 320 * 340 * 360 

LpPBPCb : .GGCGTCTCTCTGCAGCGTGGCAAATGTATAAAGCACAAGAAGATCTCATAAAGGTGGCAA : 360 

* 380 * 400 * 420 
LpPEPCb : AGCAATATGGAGTAAAGTTAACAATGTTTCATGGAAGAGGTGGAACGGTTGGCAGAGGAG : 420 

* 440 * 460 * 480 
LpPEPCb : GTGGTCCCAGTCATCTTGCTATATTATCTCAACCACCAGACACaATACAAaGAT : 480 

* 500 * 520 * 540 
LpPEPCb : GTGTAACAGTTCAAGQCGAGGTCATAGAGCACTCATTTGGAGAGGAACACT^ : 540 

* 560 * 580 * 600 
LpPEPCb : GAACTCTGCAACGTTTCACTGCAGCTACTCTTGAGCATGGAATGCATCCTCCAATTTCAC : 600 

* 620 * 640 * 660 
LpPEPCb : CCAAGCCAGAATGGCGTGCTATAATGGATGAGATGGCTGTAGTGGCAACAAAAGAATATC : 660 

* 680 * 700 . * 720 
LpPEPCb : GATCAATTGTCTTCCAAQAACCACGTTTTGTCQAATACTTCCGCTCGGCAAC^ : 720 

* 740 * 760 * 780 
LpPBPCb : CTGAATATGGTCGGATGAATATTGGTAGCCGGCCATCAAAGAGAAAGCCTAGTGGAGGCA : 780 

* 800 * 820 * 840 
LpPEPCb : TAGAATCGCTCCGTGCAATTCCATGGATCTTTGCTTGGACACAGACAAGGTTO : 840 

* 860 * 880 * 900 
LpPEPCb : CTQTATGGCTTGGATTTGGTGCAGCGTTCAAACATATCATGCAGAAGQACAT^ : 900 



* 920 * 940 * 960 

LpPEPCb : TCCATACTCTGAAAGAAATGTACAATGAGTGGCCATTCTTTAGGGTCACCCTTGACTTGC : 960 




_ * 980 * 1000 * 1020 

LpPEPCb : TTGAGATGGTTTTTGCCAAGGGAGATCCAGGAATTGCTGCTTO : 1020 

* 1040 * 1060 * 1080 
LpPEPCb : TGTCTQAAGATCTGCAGCCCrTTGGGGAGCAGCTGAGAAACy^ : 1080 

* 1100 ' * 1120 • * 1140 
LpPEPCb : AGTTACTCCTTCAGGTTGCTGGCCACAAGGACGTTCTTGAAGGGGATCCTTACCTC^ : 1140 

* 1160 * 1180 * ' 1200 
LpPEPCb : AGCGTCTGCGGTTGCGTGAGTCATACATCACAACATTGAATGTTTGCCAAGCCNACACCC : 1200 

* 1220 * 1240 * 1260 
LpPEPCb : TCAAGCGGATAAGAGACCCTAGCTTCGAGGTGACACCGCAGCAGGCACCTCTGTCGA^ : 1260 

* 1280 * 1300 * 1320 
LpPEPCb : AGTTCGCTGATGAGAAGGAGCCAGCTGAGCTGGTGOUVCTQAACCGTGGGAGCGAGTACG : 1320 

i 

* 1340 * 1360 * 1380 
LpPEPCb : CCCCAGGCCTGGAGGACACGCTCATCCTTACCATGAAGGGTATTTGCTGTGGAATGCAAA : 1380 

* 1400 * 1420 • * 1440 
LpPEPCb : ACACAGGCTAGGCCAGTTTGCCTATTTGGAATAACTGTCATCCCCT^^ i 1440 

* 1460 * 1480 * 1500 
LpPEPCb : ATATGTGTGTTCCCCAAATGCTAGTGAACCCTGGAGGCATTTTCGCCACTO : 1500 

* 1520 * 1540 * 1560 
LpPEPCb : TTGGTTATGGATGNACTTTGATCTTAATGNCAAGGGTTGTTGAAGCCTGATCTAAATAAA : 1560 



. * 1580 * 1600 * 1620 

LpPEPCb : ATATGGAAC^ATGATATTCTCGTNGGATCTAATAATTTGCTTGGC^ 



1620 



J 

* 1640 
LpPEPCb : GNOATTTGGAQTNGTTTAAC : 1640 



re 45 Deduced amino acid sequence o£ LpPEPCb 

* 20 * 40 * 60" 
LpPEPCb : RSCXCFIQCIXVLABLPADCPGAYIISMATAPSDVIjAVELLQRECHIKKP^ : 60 

* 80 * 100 * 120 
LpPEPCb : DLEXAPASVJUUJFSlDWYMNRINGKQEVMlGYSDSGKDAGiajSAAWQMYK^ : 120 

• * 140 * 160 * 180 

LpPEPCb : QYGVKLTMPHQRGGTVGRGGGPSHLAILSQPPDTIQGSLRVTVQGEVIBHSFGEEHLCPR : 180 

* 200 * 220 * 240 
LpPEPCb : TLQRFTAATLEHGMHPPISPia»EWRAIMDEMAWATKEYRSIVFQEPRFVEYP^ : 240 

* 260. * 280 * 300 
LpPEPCb : EYQRMNIGSRPSKRKPSGGIESLRAIPWIFAWTQTRFHLPVWLQFGAAFKHIMQKDIRNI : 300 

) 

* 320 * 340 •* 360 
LpPEPCb : HTLKEMYNEWPPFRWLDLLBM\^AKGDPGIAALYDKLLVSEDLQPP6EQLRiraFEETKQ : 360 

* 380 ' * • 400 * 420 
LpPEPCb : LLLQVAGHKDVLEGDPYLKQRLRLRESYITTLNVCQAXTLKRIRDPSPEVTPQQAPLSKE : 420 

* 440 * 460 
LpPEPCb : FADEKEPAELVQLNRGSEYAPGLEDTLILTMKGICCGMQNTG : 462 



WPRe 46 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contig sequence LpPEPCb 



20 



40 



60 



LpPEPCbl 
LpPEPCb2 
LpPSFCbB 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



GAAGAAGTTGCTGATGTTTTAAGNACATTTNTGTCCTTGCAGAGCTCGCAGCAGATTGTT 



60 



80 



100 



120 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCbS 



TTGGTGCTTACATCATCTCAATGGCAACTGCCCCATCTGATGTGCTTC^CTGTTGAGCTTT 



120 



140 



160 



180 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



TGCAGCGGGAGTGCCATATAAAAAAGCCATTGAGAGTTGTTCCACTAirTTGAAAAGCTTG 



180 



200 



220 



240 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



CAGATCTTGAANCAGCTCCAGCATCTGTTGCACGACTATTTTCAATAGACTGGTACATGA 



240 



260 



280 



300 



LpPEPCbl 
LpPEPCb2 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



ATAGAATCAATGGCAAGCAGGAGGTCATGATTGGATACTCAGACTCUGGGAAGGACGCTG 



300 



320 



340 



360 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



ggcg tctctctgcagcgtggcaaatgtataaagcacaagaagatctgataaaggtggcaa! 

GTATA7\AGCACAAGAAGATCTGATAAAGGTGGCAA 



360 

35 



* 380 * 400 * 420 

LpPEPCbl 
LpPBPCb2 
LpPEPCb 3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



AGCAATATGGAGTAAAGTTAACAATGTTTCATGGAAGAGGTGGAACGGTTGGCAGAGGAG 
A.GCAATATGGAGTAAAGTTAAC AATGTTTCATGGAAGAGGTGGAACGGTTGGCAGAGGAG 
BHBgBJBJ|^B^^BHBAA.TGTTTBgTGGAAGAGGTGGAAC(gGTTG GCAGAGGAG 
^^^WWi^^^^fc— WWW— ^W^^^^^IW^— Wr; C Af AG GAG 




LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



440 



460 



GTGGTCCCAGTCATCTTGCTATATTATCTCAACCACCAGACACGATAGAAGGATCACTTC 
GTGGTCCCAGTCATCTTGCTATATTATCTCAACCACCAGACACGATAGAAGGATCACTTC 
GTGGTCCCAGTCATCTTGCTATATTATCTCAACCACgAGACACGATAGAAGGATCACTTC 
GTGGTCGCAGTCATCTT GCTATATTA TCTC AACCACCAGACACGATAGAAGGATCACTTC 



480 
155 
97 
69 



LpPEPCbl 
LpPEPCb2 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
LpPEPCbe 



500 



520 



GTGTAACAGTTCAAGGCGAGGTCATAGAGCACTCATTTGGAGAGGAACACTTGTGCTTCA 

gtgtaacagttcaaggcgaggtcatagagcactcatttggag|ggaacacttgtgcttca 

GTGTAACAGTTCAAGGCGAGGTCATAGAGCACTCATTTGGAGAGGAACACTTGTGCTTCA 
GTGTAACAGTTCAAGGCGAGGTCATAGAGCACTCATTTGGAGAGGAA(j:ACTTGTGCTTCA 



LpPEPCbl 
LpPEPCb2 
LpPEPCbS 
J LpPEPCb4 
LpPEPCb5 
LpPEPCbe 



560 



580 



600 



gAACTCTGCAACGTTTCACTGCAGCTACTCTTGAGCATGGAATGCATCCTCCAATTTCGC 
GAACTCTGCAACGTTTCACTGCAGCTACTCTTGAGCATGGAATGCATCCTCCAATTTCAC 
GAACTCTGCAACGTTTCACTGCAGCTACTCTTGAGCATGGAATGCATCCTCCAATTTCAC 
GAACTCTGCAACGTTTCAC TGCAGCTACTCTTGAGCATGGAATGCATCCTrPZ\2XTTTrar 



600 
275 
217 
189 



LpPEPCbl 
LpPEPCb2 
LpPEPCba 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



620 



640 



C C AAgC C AG AATG GCijTGCTATAATG GATGAi|ATGGCTGTAGNGGCAgCAAAAGAAg!ATC 
CCAAGCCAGAATGGCGTGCTATAATGGATGAGATGGCTGTAGTGGCAkCAAAAGAATATC 
CCAAGCCAGAATGGCGTGCTATAATGGATGAGATGGCTGTAGTGGCAkcAAAAGAATATC 

ccaagccagaatggcgtgctataatggatgagatggctgtagtggca!acaaaagaatatc! 



660 
335 
277 
249 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



680 



700 



720 



GATCAATTGN.CTTCCAAGAACCigCgTTTTG|r"cgAATA 

gatcaattgtcttccaagaaccacgttttgtcgaatacttccgctcggcaacacctgaga 

GATCAATTGTCTTCCAAGAACCACGTTTTGTCGAATACTTCCGCTCGGCAACACCTGAGA 
GATCAATTGTCTTCCAAGAAC GACGTTTTGTCGAATACTTCCGCTCGnraArarrTnZinz^ 



697 
395 
337 
309 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



740 



760 



780 



CTGAATATGGTCGGATGAATATTGGTAGCCGGCCATCAAAGAGAAAGCCTAGTGGAGGCA 
CTGAATATGGTCGGATGAATATTGGTAGCCGGCCATCAAAGAGAAAGCCTAGTGGAGGCA 
CTGAATATGGTCGGATGAATATTGGTAGCCGGCCATCAAAGAGAAAGCCTAGTGGAGGCA 



455 
397 
369 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



800 



820 



840 



tagaatcgctccgtgcaattccatggatctttgcttggacacagacAaggtttcatcttc 

TAGAATCGCTCCGTGCAATTCCATGGATCTTTGCTTGGACACAGAC^AGGTTTCATCTTC 
rAGAATCGCTCCGTGCAATTCCATG GATCTTTGCTTGGACACAGACAAGGTTTCATCTTC 



515 
457 
429 




860 



880 



900 



LpPEPCbl : 
LpPEPCb2 : 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



CTGTATGGCTTGGATTTGGTGCAGCGTTCAAACATATCATGCAGAAGGACATCAGGAATA 
CTGTATGGCTTGGATTTGGTGCAGCGTTCAAACATATCATGCAGAAGGACATCAGGAATA 
CTGTATGGCTTGGATTTGGTGCAGCGTTCAAACATATCATGCAGAAGGACATCAGGAATA 



LpPEPCbl 
LpPEPCb2 
lipPEPCOaS 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



920 



940 



960 



TCCATACTCTGAAAGAAATGTACAATGAGTGGCCATTCTTTAGGGTCACCCTTGACTTGC 
TCCATACTCTGAAAGAAATGTACAATGAGTGGCCATTCTTTAGGGTCACCCTTGACTTGC 
TCCATACTCTGAAAG7\AATGTACAATGAGTGGCCATTCTTTAGGGTC|^CCCTTGACTTGC 



LpPEPCbl 
LpPEPCb2 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
LpPEPCbS 



980 



1000 



1020 



TTGAGATGGTTTTTGCCAAGGGAGATCCAGGAATTGCTGCTTTATATGACAAATTGCTTG 
TTGAGATGGTTTTTGCCAAGGGAGATCCAGG^ATTGCTGCTTTATATGACAAATTGCTTG 
TTGAGAT GGTTTTTGCCGAGGGAGATCCAGGAATTGCTGCTTTATATGACAAATTGCTTG 
GGTTTTTGBCilAGGGAGATCCBGGHATTGCTGCTTTATATpACAAATTGCTTG 



LpPEPCbl 
LpPEPCb2 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
LpPEPCbS 



1040 



1060 



1080 



TGTCTGAAGATCTGCAGCCCTTTGGGGAGCAGCTGAKAAACAACTTTGAAGAGACGAAAC 

tgtctgaagatctgcagccctttggggagcagctgagaaacaactttpaagagacgaaac 
tgtctgaagatctgcagccctttggggagcagctgagaaacaactttpaagagacgaaac 
tgtctgaagatctgcagccctttggggagcIgctgagaaacaacttt.gaagagacgaaac 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCbS 



1100 



1120 



1140 



AGiJTACTCgTTAiAGGTTGgTGgCCACAAGGl 

agttactccttcaggttgctggccacaaggacgttcttgaaggggatccttacctgaagc 
agttactccttcaggttgctggccacaaggacgttcttga@gggga'i]ccttacctgaagc 
agttactcctt caggttgctggccacaaggacgttcttgaaggggatccttacctgaagc 

ggacgttcttgaaggggatccttacctgaagc 



1160 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpP£PCb4 
tpPEPCbS 
LpPEPCbS 



1180 



1200 



AGCGTCTGCGGGTGCGTGAGTCATAC 
AGCGTCTGCGGTTGCGTGAGTCATACATCACAACAj 
AGCGTCTGCGGTTGCGTGAGTCATACATCACAACATTGTVATGTTTGGCAAGCCgACACCC 
AGCGTCTGCGGTTGCGTGAGTCATACATCACAACATTGAATGTTTGGCAAGCGN'SCACCC 



783 
764 
230 
92 



LpPEPCbl 
LpP£PCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCbS 



1220 



1240 



1260 



TGAAGCGGATAAGAGACCCTAGCTTCGAGGTGACACCGCAGCAGGC^j^CCTCTGTCGAAGG 
TGAAGCGGATAAGAGACCCTAGCTTCGAGGTGACACCGCAGCAGGC^CCTCTGTCGAAGG 



290 
1S2 




1280 



LpPEPCbl 
IipPEPCb2 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
IipPBPCb6 



1300 



1320 



AGTTCGCTGATGAGAAGGAGCCAGCTGAGCTGGTGCAACTGAACCGTGGGAGCGAGTACG 
AGTTCGCTGATGAGAAGGAGCCAGCTGAGCTGGTGCAACTGAACCGTGGGAGCGAGTACn 



350 
212 



LpPEPCbl 
LpPEPCb2 
LpPEPCb3 
LpPEPCb4 
LpPEPCbS 
LpPEPCb6 



1340 



1360 



1380 



CCCCAGGCCTGGAGGACACCCTCATCCTTACCATGAAGGGTABxTGCrrGTGGAATGCAAA 

ccccaggcctggaggacaccctcatccttaccatgaagggtatttgcItgtggaatgcaaa 



409 
272 



LpPEPCbl 
IipPEPCb2 
LpPEPCbB 
. IjpPEPCb4 
IipPEPCbS 
LpPEPCbe 



1400 



1420 



1440 



ACACAGGCTAGGCCAGTTTGCCTABTTGGAATAACTGTCATgCCGTCAGATGG^ 

acacaggctaggccagtttgcctatttggaataactgtcatcccgtcagatBgggcgtga 



LpPEPCbl 
XipPEPCb2 
LpPBPCbS 
IipPEPCb4 
IipPEPCbS 
IipPEPCbe 



1460 



1480 



1500 



atatgtgtgttccccaaatgctagtgaaccctggaggcattttggcgacttacatgcctt 



ATATGTGTGTTCCCCAAATGC' 



3AACCCTGGAGGCAHtTTGGCGACTTACATGCCTT 



528 
390 



IjpPEPCbl 
IipPEPCb2 
IipPEPCb3 
£ipPEPCb4 
LpPEPCbS 
LpPEPCb6 



1520 



1540 



1560 



TTGGTTATG.SATGmcBTTGATCTTAATGlNlCAAGGGTTGTTGAAGCGTGATCTAAATAAA 
TTGGTTATGGATGNACTTTGATCTTAATG^^CAAjGGTTGTTGAAGCQTGATCTAAATKTAa 



587 
450 



LpPEPCbl 
LpPEPCb2 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
LpPEPCbe : 



1580 



1600 



1620 



ATATGGAACAATGATATTCTGGglvrGGATCTAATAATTTGCTTGGCTGTGGCATCGlNlAATA 
ATATGGAACAATGATATTCTGGT/gGgfjTCTlSA' 



646 
482 



LpPEPCbl 
LpPEPCb2 
LpPEPCbS 
LpPEPCb4 
LpPEPCbS 
LpPEPCbe 



1640 



GNGATTTGGAGTNGTTTAAC 



666 



^lUllie 47 Nucleotide sequence of LpPEPCc 

* 20 * 40- * 6.0 

LpPEPCc : AGO^CTGTNCTTNCCANCAACCa^CGTTTTGTNaSJ^ : 60 



* 80 * 100 * ' 120 

LpPEPCc : GCACACTGAATATGGTCGGCATGAATATTGGTiWSCCGGCCATaU^GAGAAAGCCTAGTG : 120 



* 140 * 160 * 180 

LpPEPCc : GAGGCATAGAATCGCTCCQTGCAATTCCATGCATCTTTGNTTGGACACAGACAAGGOT : 180 



* 200 * 220 

LpPEPCc : ATNTTCCTGTATGNCTTGNATTCGNCTCCACCNCCACCCCCNTA : 224 



m 

e 48 Deduced amino acid sequence of LpPEPCc 

* 20 * 40 . * 60 

LpPEPCc : XSXIiXXNHVIiXEYXPLGimrTIilOT^ • ^0 



LpPEPCc : XPVXLXFXSTXTP ; 73 



^^jllie 49 Nucleotide secpience of LpPBPCd 



* 20 * . 40 •* 60 
IipPEPCd : GTTNCTGGAACNiUlGOATCITCTTGAAGGT^ : 60 

* 80 * 100 * 120 
LpPEPCd : CGTGACGCGTACATCACCACCaVTGAACQTATGCCaVGGCCTACAa^TTG^ 2 12 0 

* 140 * 160 * 180 
LpPEPCd : GACGCAGACTACCACQTCGCACa?GCGGCCCCATCTTTCCAAGGAGGTTATG^ : 180 

* * 200 * 220 * 2*40 
LpPEPCd t AAGCCGGCTTCCGAGCTTGTGACGCTGAACCCGGCCAGCGAGTACGCCCCGGGGCTGGAG : 240 

* 260 * 280 * 300 
LpPEPCd : GACACCCTCATCTTGACCATGAAGGGCGTTGCTGCCGGTCTGCAAAACACCGGTTAGGGC : 300 

) 

* 320 * 340 * 360 
LpPEPCd : CAGGAGAGATGCCTGATCACCATCTTTTTGTATCTTCATGATGATGCGATGTT^ : 360 

* 380 * 400 * 420 
LpPEPCd : AGTCGTTTGCGGTGGGCCTTATATCTCTCGGACGTAGCTGCATCTGTCTCCCTGCTCAGT : 420 

* 440 * 460 * 480 
LpPEPCd : GAGGAATAATGGCGTTTCGCCCAAGTATATTGATAAATAAAGGGAACCGATGTXW^TTTC : 480 



* 500 • * 520 

LpPEPCd : AGATTTGTTTGTTAGTAATTGTTCTATTTATTTTGCGAAAAAAAAAA : 527 



e 50 Deduced amino acid sequence of LpPBPCd 

* 20 * 40 * 60 
LpPBPCd : VXGXKDLIiBGDPYLKQRIiRLraAYITTMNVCQAYTLKRIRDPD : 60 

* 80 * 

LpPEPCd : KPASELVTLNPASEYT^GLEDTIiIIiTMKGVAAGIiQimS : 98 



) 



ft 



e 51 Nucleotide sequence of LpPBPCe 



* 20 ♦40 * 60 

IipPEPCe : GTTACA( 



CGCGCMTTTGCrrGTTAGCAAGGNAGATaGCTGCTAACT^roT^ 60 



* 80 * 100 * • 120 

LpPBPCe : ATAQAAGATCTQATGTTTQAGCTCTCTATGTCWCGCTGCAGTGATGAACTTJ«3<K5TC^^ : 120 



* 140 * 160 * 190 

LpPBPCe : GCAGATGAAGTACAT* 



J, "jc w — — — 

CTGTCCTCAAAAAAAAAATCTQCAAAGCATTACATAGAGTTCTGG : 180 



* 200 * 220 * 240 

LpPBPCe : AAG' 



CAAGTTCCTCCAAATGAACCTTATCGTGTCATACTTaQCQATGTCAGGGATAAJUrrQ s 240 



* 260 * 280 * 300 

LpPBPCe : TACTATAi 



.CGCGCGAACGTTCTCGCCACATATTGACAACTGGAArrTCAGACATTCCAGAA : 300 



* 320 * 340 * 360 
LpPBPCe : GNGTCAACTTTTACTAATGTTGAACTGTTTCTGGAACCTCTTCAGCTGT^ : 360 

* 380 * 400 * 

LpPEPCe : TTATCTTNCTGTGGNGACaAANCTATTGCTGMIGGAASCCTTCTO •• 420 

* 440 * 
LpPBPCe : GNATCNACTTTGTGGGCTTACTCTNGCGAA : 450 



IIIBI^< 



e 52 Deduced amino acid secjuence of LpPBPCe 

* 20 ■ * 40 * 60 
LpPEPCe : WIU^VCLLARXMAANLYFSQIEDLMPELSMWRCSD 

* 80 * 100 * 
LpPEPCe : KQVPPNEPYRVIMDVWDKLYyTRERSIUIILTTGISDIPEXSTP^^ 

* ' 140 * 
IipPEPCe : liSXCXDKXIAXGSIiLDFXXXXXTLWAYSXE : 150 



e-53 Nucleotide sequence of LpPKPCf 

♦ 20 *. 40 * 60 

LpPEPCf : GGGGTGGTGGCCCTNCTCACCTTGCCTaJCCTGTCTCMICCACCAGNC»lC3\ATaUV^^^ s 60 



100 * 120 



* 80 * J-"" 
LpPBPCf : CMVCTCCGGGTGACTGTTCAAGGTaAAGTTATTGAGCAGAGCTT^^ : 120 . 

140 * 160 * • 180 

LpPEPCf : GCTTCAGOACGCTGCaaCGTTTCACAGCTGCTACTCTTaAQaV^ ' 180 

* 200 * 220 * 240 
LpPEPCf : TraCACa^GCCAGAGTGGCGAGCTCTTCTTGATGAGATGGCTGTGGTTGO^CTCAGG : 240 

* 260 * ■ 280 • * • 300 
LpPEPCf : AATACC6OTCAATCGTCTTCCAAGAACCACGCTTCQTCGAGTATTTCCGCCTTGCAACAC : 300 

! * 320 * 340 * 360 

LpPEPCf : CAGAGACAGAGTATGGCA6GATGAATATAGGAAGCAGGCCATCAAAGAGAAAACCAAGTG : 360 

* 380 * 400 * 420 
LpPEPCf : GTGGCATTGAATCACTCCGTGCAATTCCATGGATCTTCGCATGGACGCAQACACGGTTCC s 420 

* 440 * 460 * 480 
LpPEPCf : ACCTTCCAGTCTGQTTGGGCTTTGGTGQTGCATTCAAGCATATCCTCAAGAAGGACA : 480 

* 500 * 520 * 540 
LpPEPCf : GAAATTTCCATATaCTCCAGOAQATGTACAACQAGTGGCCATTTTTCAGGGTCACQATCG : 540 

* 560 * 580 * 600 
LpPEPCf : ATCTTGTTGAGATGGTGTTCGCCAAGGGTAACCCTGGCATTGCTGCCTTGTATGACAGGC : 600 

' « 620 * ■ 640 

LpPEPCf t TCCTGGTTTCAGaGGAGCTACAGCCACTGGQTGACAAGCTGAOG : 644 




te 54 Deduced amino acid sequence of LpPEPCf 



20 * . 40 * ^0 

LpPEPCf ; GGGPXHIjAXLSXPPXTI 



CINGSIiRVTVQGBVIEQSFGEEHLCFRTLQRPTiUmiBHGMRPPI 



* 80 * 100 * 120 
LpPEPCf : SPKPEWRALIiDEMAWATEEYRSIVFQEPRFVEYFRIATPETByGRMNIGSRPSK^ 

* 140 * 160 * 180 
LpPEPCf : GIESIJJAIPWIFAOTTQTRFHLPVWIiGFGGAFKHILKKDIRNFH^ 

* 200 ' * 
LpPEPCf : LVEMVFAKGNPGIAALYDRLIiVSEELQPIiGDKLR : 214 




Figure 55 Consensus contig nucleotide sequence of TrMDHa 

* 20 * • 40 * 60 
TrMDHa : GGCTTCTTAAAAACNCACTW^CTCTTTTCTATTGTTCTTATTTCTTCGATCTATTTC^ : . 60 

* 80 * 100 * "° 
TrMDHa s ATGGC(»AAGACCCAGTTaJTGTTCTTGTC3^CTGGTG<nXKa^GGAC^ : 120 

* 140 * 160 * 180 
TrMDHa : CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCS3GCCCTaACCAGCCTGTGATCCTCCAC : 180 

* • 200 * 220 * 240 
TrMDHa : ATQCTTOACavrTCCACCTGCAGCCGAATCACTCAACGGTGTTAAAATGGAGTTGGTGGAT : 240 

« 260 * • 280 * 300 

, TrMDHa : OCTGCATTCCCTCTTCTXAAAGGAGTTGTTOCTACAACTGATGTGGTTGAGGCATGCaVCT : 300 

* 320 * 340 * 360 
TrMDHa : GGTGTCAATATrGCCGTTATGGTTOGTGGGTTCCCTAQAAAAGAAGGTATGGAOAGGAAA s 360 

* 380 * 400 * 420 
TrMDHa : QATGTGATGACAAAAAATGTCTCTATTTACAAGTCTCAGGCTTCTGCCCTTGAAAAACAT : 420 

* 440 * 460 * 480 
TTMDHa : QCTQCTGOUU^CTGCAAGGTTCTTGTTGTTGCCAACCCaMKaU^CAC^ : 480 

* 500 ■ * 520 * 540 
TrMDHa : TTQAAGGAATATGCTCOVTCCaTTCCTGAGAAAAACATTTCTGCraTGACTAGATTGQAC t 540 

* • 560 * 580 * 

.TrMDHa : CATAACAGGaCACTAGGTCAAATTTCTGAAAGACTAAACGTTOAAQTTTCTQATGTGAAA : 600 

* 620 * . 640 * • 660 
TrMDHa : AATGTTATAATATQGGGGAAATNATTCATCaVACTCAATACCCTGNTGTHAACCSLCNCA^ : 660 



* 

TrMDHa : CGTTAAAATCTGCT : 674 



e 56 Deduced amino acid sequence of TrMDHa 

20 * 40 * 60 

TrMDHa : MAICDPVRVLVTOAA<»IGYAI.VPMIMiG\mi^PDQPVII^IP : 60 

* -80 * * 

■nrMDHa : AAPPIlLKGWATTOVVBACTaWIAVMV(3QFPIaCE6^ffiIUa)VMTKNVSIYKSQAS^^ : 120 

* 140 ■ * 160 * "0 
TrMDHa : AAANCKVLWMJPMmTJ^ILKBYJ^SIPBKNISALTRIJJHinU^ « 180 



* 200 
TrMDHa : NVIIWGKXFINSIPXCXPXNR : 201 



IQI&e 57 Nucleotide sequences of nucleic acid ' fragments contributing to the 
consensus contig sec[uence TrMDHa 



20 



40 



60 



TrMDHal 

TrMDHa2 

TrMDHaS 

TrMDHa4 

TrMDHaS 

TrMDHa6 

TrMDHaV 

TrMDHaS 

TrMDHa9 

TrMDHalO 

TrMDHall 



Gc|TCTTBAAAAcBcACTAAACTCTTTTCTATTGTTCTTATTTCT'IjCGATCTATTTCCA 

ScttcttqSaaachcactaaactcttttctattgttcttatttcttcgatctatttcc 
IcwtSaaaacncactaaactcttttctattgttcttatttctticgatctatttBcwI 
^caftaaactcttttwtattgttctt^tttcttcgatctatttcc 
scagtaaactcttttctattgttcttatttcttcgatctatttcc 
IcactBaactcttttctBttgttctt;^,tttcti7CGatcBatttcc 
itaaactcttwtctattgttcttljtttcttcgatctatttccj 
aaactcttttctattcttcttatttctlicgatctatttcca 



80 



100 



120 



TrMDHal 

trMDHa2 

TrMDHaS 

TrMDHa4 

TrMDHaS 

TrMDHa6 

TrMDHa? 

TrMDHaS 

TrMDHa9 

TrMDHalO 

■frMDHall 



atggccaaagacccagttcgtgttcttgtc|ctggtgctgcaggac/|a;qttgggtatgct 
atggccaaagacccagttcgtgttcttgtcactggtgctgcaggacaaattgggtatgct 
atggccaaagacccagttcgtgttcttgtcactggtgctgcaggacaaattgggtatgct 

ATGGCCAAAGACCCAGTTCGTGTTCTTGTCACTGGTGCTGCAGGACAAATTGGGTATGCT 
ATGGCCAAAGACCCAGTTCGTGTTCTTGTCACTGGTGCTGCAGGACriAlCTTGGGTATGCT 
ATGGCCAAAGACCCAGTTCGTGTTCTTGTCACTGGTGCTGCAGGAC^pi^GTTGGGTATGCT 

atggccl2u\gacccagttcgtgttcttgtcactggtgctgcaggacaaattgggta 

atggccft^gacccagttcgtgttcttgthgctggtgctgcaggacaaattgggtatgct 

atggccha?\gacccagttcgtgttcttgtbactggtgctgcaggaciag^ 

atggccHaagacccagttcgtgttcttgtcactggtgctgcaggacaaattgggtatgct 

atggccaaagacccagttcgtgttcttgtcactggtgctgcaggac4aattg ggtatgct 



140 



160 



180 



TrMDHal 

TrMDHa2 

TrMDHaS 

'trMDHa4 

TrMDHaS 

TrMDHa6 

TrMDHa? 

TrMDHaS 

TrMDHa9 

TrMDHalO 

TrMDHall 



CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGgCCTGACCAiMaTGTG 

CTlSGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCaTGTGATCCTCCAC 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCGTGTGATCCTCCAC 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCqTGTGATCCTCCAC 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCGTGTGATCCTCCAC 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCGTGTGATCCTCCAC 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCCTGTGATCCTCCAG 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCQTGTGATCCTCCAC 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCGTGTGATCCTCCAC 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCGTGTGATCCTCCAG 

CTTGTCCCTATGATTGCTAGGGGAGTGATGCTCGGCCCTGACCAGCGTGTGATCCTCCAC 



200 



220 



240 



TrMDHal 

TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMDHaS 

TrMDHa6 

TrMDHa? 

TrMDHaS 

TrMDHaS 

TrMDHalO 

TrMDHall 



ATGCTTGACATT-g;CACCTGj|AGL 

atgcttgacatgccacctgcagccgaatcactgaacggtgtgaaaat:jggagttggtggat 
atgcttgacattccacctgcagccgaatcactcaacggtgttaaaatggagttggtggat 

ATGCTTGACATTCCACCTGCAGCCGAATCACT§AACGGTGTTAAAA'I|GGAGTTGGTGGAT 
ATGCTTGACATTCCACCTGCAGCCGAATCACTCAACGGTGTTAAAA1|GGAGTTGGTGGAT 
ATGCTTGACATTCCACCTGCAGCCGAATCACTCAACGGTGTTAAAAT|GGAGTTGGTGGATi 
ATGCTTGACATTCCACCTGCAGCCGAATCACTCAACGGTGTTAAAA'IjGGAGTTGGTGGAT, 
ATGCTTGACATTCCACCTGCAGCCGAATCACTCAACGGTGTTAAAAT|GGAGTTGGTGGAT 
ATGCTTGACATTCCACCTGCAGCCGAATCACTCAACGGTGTTAAAAIIGGAGTTGGTGGAT 
ATGCTTGACATTCCACCTGCAGCCGAATCACTCAACGGTGTTAAAA1[GGAGTTGGTGGAT 
ATnrTTnzxrATTrnACCTGCAGCCGAATCACTCAACGGTGTTAAAAU GGAGTTGGTGGAT 




260 



280 



300 



TrMDHal 

TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMDHaS 

TrMDHa6 

TrMDHa? 

•PrMDHaS 

TrMDHa9 

TrMDHalO 

TrMDHal 1 



TrMDHal 

TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMDHaS 

TrMDHa6 

TxMDHa? 

TrMDHaS 

TrMDHa9 

TrMDHalO 

TrMDHall 



GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACgACTGATGTGGljTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGITGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGTTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGTTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGTTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGIjTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGHTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGTjTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGTjTGAGGCATGCACT 
GCTGCATTCCCTCTTCTTAAAGGAGTTGTTGCTACAACTGATGTGGT|TGAGGCATGCACT 



* 


320 




* 


340 


* 


360 





GGgGTCAATATTGCCGTTATGGTTGG^'GGGTTCCCTAGAAAAGAAGGTATGGAGAGGAAA 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTAGAAAAGAAGGTATGGAGAGGAAA 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTAGAAAAGAAGGTATGGAGAGGAAA 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTAGAAAAGAAGGTATGGAGAGGAAA 
GGTGTNAATATTG|CGgTATGGNTGGWGGGTT|TCW.TA{|gASA| 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTAGAAAAGAAGGT^ 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTAGAAAAGAAGGTATGGAGAGGAAA 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTAGAAAAGAAGGTATGGAGAGGAAA 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTAGAAAAGAAGGTATGGAGAGGAAA 
GGTGTCAATATTGCCGTTATGGTTGGTGGGTTCCCTANAAAAGAAl^GTATGGAGAGGAAA 



299 
297 
297 
295 
2 94 
284 
283 
279 
280 
281 



359 
357 
357 
355 
344 
344 
343 
339 
340 
341 



380 



400 



420 



TrMDHal 

TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMDHaS 

TrMDHae 

TrMDHa7 

TrMDHaS 

TrMDHa9 

TrMDHalO 

TrMDHall 



GATGTGATGACTUU^AAATGTCTCTATTTACAAGTCTCAGGCTTCTGGCCTTGAAAAACAT 
GATGTGATGACAAAAAATGTCTCTATTTACAAGTCTCAGGCTTCTGCCCTTGAAAAACAT 
GATGTGATGACAAAAAATGTCTCTATTTACAAGTCTCAGGCTTCTGOCCTTGAAAAACAT 
GATGTGATGACAAAAAATGTCTCTATTTACAAGTCTCAGGCTTCTGQCCTTGAAAAACAT 



gatgtgatgacaaaaaatgtctctatttacaagtctcaggcttctggccttgaaaaacat 
gatgtgatgacaaaaaatgtctctatttacaagtctcaggcttctggccttgaaaaacat 
gatgtgatgacaaaaaatgtctctatttacaagtctcaggcttctgcccttgaaaaacat 
gatgtgatgacaaaaaatgtctctatttacaagtctcaggcttctgqccttgaaaaacat 
gatgtgatgacBaaaaatgtctctatttacaagtctSaSgcttStgSccttgaaaaacat 



440 



460 



480 



TrMDHal 

TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMbHa5 

TrMDHae 

TrMDHa7 

TrMDHaS 

TrMDHa 9 

TrMDHalO 

TrMDHall 



gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacagcaatgcattgatc 
gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacagcaatgcattgatc 
gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacaccaatgcattgatc 
gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacagcaatgcattgatc 



gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacac'caatgcattgatc 
gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacaoctuvtgcattgatc 
gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacagcaatgcattgatc 
gctgctgcaaactgcaaggttcttgttgttgccaacccagcaaacaccaatgcattgatc 
gctgctgcaaactgcaaggttcttgttgttgncaacccaScaaacagcaatgcattgatc 




TrMDHal 

TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMDHaS 

TrMDHaS 

TrMDHa? 

TrMDHaS 

TrMDHaS 

TrMDHalO 

TrMDHall 



500 



520 



540 



TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTTT|GACTAGATTGGAC; 
TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTTTpACTAGATTGGAC 
TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTTTpACTAGATTGGAC 
TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTTT^GACTAGATTGGAC 



TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTTTjGACTAGATTGGAC! 
TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTT1|GACTAGATTGGAC 
TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTTT|GACTAGATTGGAC 
TTGAAGGAATATGCTCCATCCATTCCTGAGAAAAACATTTCTGCTTT|GACTAGATTGGAC 
IXTGAAGGAATATGCTCCATNCATTCCTGAHAAAAACATTTlTGCTT'IiGACTAGATTGGAC 



560 



580 



600 



TrMDHal 

'TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMDHaS 

TrMDHa6 

TrMDHa? 

TrMDHaS 

TrMDHaS 

TrMDHalO 

TrMDHall 



CATAACAGGGCACT^GGTCA 
CATAACAGGGCACTAGGTCAAATTTCTGAA 
CATAACAGGGCACTAGGTCAAATTTCTGAAAGACTAAA 
CATAACAGGGCACTAGGTCAAATTTCTGAAAGACTAAACGTTGAAGTjTTCTGATGT GAAA 



CATAACAGGGCACTAGGTCAAATTTCTGAAAGACL 
CATAACAGGGCACTAGGTCAAATTTCTGAAAGACTAAACGTTGAAGTTTCTGATGTGAAA 
CATAACAGGGCACTAGGTCAAATTTCTGAAAGACTAAACGTTGAAGljTTCTGATGTGg 
CATTVACAGGGCACTAGGTCAAATTTCTGAAAGACTAAACGTTGTIAGTTTCTGATGTGAAA 
CATAACAGGGCACTAGGSCAAATTTiTGAAASfACTAAACGTTGAAGllTTlTGATGTGAAA 



TrMDHal 

TrMDHa2 

TrMDHa3 

TrMDHa4 

TrMDHaS 

TrMDHae 

TrMDHa? 

TrMDHaS 

TrMDHaS 

TrMDHalO 

TrMDHall 



620 



640 



660 



AATGTTAT; 



AATGTTATAATg:TGGG 



AATGTTATAATgTG 

TVATGTTATHATATGGGGGAAATNATTCATCAACTCAATACCCTGNTGTNAACCACISrCAAC 



I 



TrMDHal 
TrMDHa2 
TrMDHa3 
TrMDHa4 
TrMDHaS 
TrMDHa6 
TrMDHa? 
TrMDHaS 
TrMDHaS 
TrMDHalO 
TrMDHall 



CGTTAAAATCTCCT 



653 




e 58 ConeensuB contig nucleotide seiiuence of • TrMDHb 



« 20 * 40 . * 60 

TrMDHb : TTCTCCO^AATCHNGAAMiaSCCCANACftTa^CAACATAA : 60 

* 80 * 100 ■ * 120 
TrMDHb : TCTCTNAACAAWVACTGTTOTCCTCTCTTAATCTTCCCTGTTCOATTCCTTCCATTTCT : 120 

■ * * 160 ' * ■ 180 

TrMDHb : TCAAAAATGGCCAAAGACCCAGTTCGTQTTCTCGTOVCTCSGTQC?^ : • 180 

* 200 * 220 * 240 
TrMDHb : TATGCACTTGTCCCTATGATTGCTAGGGGafiTGATGCTTOGTCCTGATCAACCTGTGATC s 240 



* 280 * 



TrMDHb CTTCACaVTQCTTGATATTCCTCCAGCAGCAGAGTO^TTGAATCQAGTTAAGATGGAGTTG ; 300 

>. * 320 * 340 * 360 

TrMDHb : GTCGATGCTGCATTTCCACTTCTTAAAGGTGTTGTTGCTACAACTGATOTTGTTGAAGCA s 360 

380 .* 400 * 420 ■ 

TrMDHb : TGCACTGGAGTCAATATTGCAGTCATGGTTGGTOGATTCCCAAGAAAAGAAGGTATGGAG : 420 

* 440 * 460 * 480 
TrMDHb ; WSGAAGGATGTGATGTCTAfiGaACGTCTCrATTTACAAGTCCCAGGCTTCTGCC^ : 480 

* 500 '* 520 • * 540 
TrMDHb : AAGCaVTGCTGCTGCCAACTGaUVGGTTTTGGTTGTTQCTAACCCSWKaUU^ : 540 

* 560 * 580 . * • 600 
TrMDHb : TTGATCTT6AAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA : 600 

• * 620 * 640 * 660 

•^MDHb : CTTQATCACAACAOGGCATTGQGCCaAATTTCTaAAAGATTGAATGTTCAAGTTTC^ : 660 

* 680 * ■'OO * 720 
TrMDHb : GTAAAGAATGTCATTATCTGGGGTAATCATTO^TCAACTCAGTATOTTGATGTCAACCA^ : 720 



* 760 * "^80 



TrMDHb : GCAACTGTtLcACCCCCGCTGGGGAGAAGTCTGTCCGTGAGC^TTTCTGATGA^ : 780 

* 800 * 820 * 840 
TrMDHb : TGGTTGAATGGAGAATTO^TATCTACCGTTC^ACAACGTGGTGCTGCAATTATTAAGGCT 840 

* 860 * • 880 * 900 
TrMDHb : MAAAGCTTTCAAGCGCACTATCOSCTGCTAGCGCTGCTTGCGAC^ : 900 

* 920 * * 

TrMDHb : GTTCTTGGAlcTCCCa^GG6CACCTTCGTTTCAATGGG2«STGTATTCTGATGOTTC^ : 960 




980 1000 * 1020 

TrMDHb : AACG' 



[TACCAGCTGGACTCATCTATTOVTTCCCTGTCACCACTGCTAA^ : 1020 



* 1040 * 1060 * 1080 
TrMDHb : ATTGTTCAAGGACTTTCSVATTGACGAGTTCTa^GGAAG^^ 1080 

* 1100 • * 
TrMDHb : GAGTTATCCGAGGAAAAGAGTTTGGCATACT : 1111 



^^Le 59 Deduced amino acid sequence of TrMDHb 

* 20 * 40 * 60 
TrMDHb : MAKDPVRVIiVTGJUlGQIGYALVPMIARGVMLGPDQPVIL^ s 60 

* 80 * 100 * 120 
TrMDHb : AAFPIiLKGWATTDVVBACTGWIAVMVGGPPIOCBGMBRKDVMSKlW : 120 

* 140 * 160 * 180 
TrMDHb : AAANCKVIiWANPAimrALILKEPAPSIPEKlTISCLTRljDH^ ' 180 



★ 200 * 220 * 240 

TrMDHb : NVIIWGWHSSTQYPDViraATViraPAGEKPVREIjVSDDAWLNQEFISTVQQRG^^ : 240 



* 260 * 280 * . 300 

TrMDHb : LSSALSAASAACDHIRDWVIiGTPQGTFVSMGVYSDGSyNVPAGIilYSFPWTANGBWKI^ : 300 



* 320 

TrMDHb : QGLSXDEFSRKKLDItTAEBLSEEKSIjAY : 328 




Ire 60 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contig sequence TrMDHb 



TrMDHbl 
TrMDHb2 
TrMDHb3 
TrMDHb4 
TrMDHbS 
TrMDHbe 
TrMDHb? 
TrMDHbS 
±rMDHb9 
TrMDHbl 0 
TrMDHbl 1 
TrMDHb 12 
TrMDHb 13 
TrMDHbl4 
TrMDHblS 
TrMDHb 16 
TrMDHblV 
J TrMDHbl 8 
TrMDHbl9 
TrMDHb20 
TrMDHb21 
TrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHb30 
TrMDHbS 1 
TrMDHbS 2 



20 



TTCTCCC-gNAATCNNGAAANC 
TTCTCNCAIvFAATCN NGA AANC 



40 



60 





TrMDHbl 
TrMDHb2 
TrMDHb3 
TrMDHb4 
TrMDHbS 
TrMDHb6 
TrMDHb? . 
TrMDHbS 
TrMDHbS 
TrMDHb 10 
TrMDHbll 
TrMDHbl2 
TrMDHbl3 
TrMDHbl4 
O^MDHblS 
TrMDHbl 6 
TrMDHbl? 
TrMDHb 18 
TrMDHbl 9 
TrMDHb20 
TrMDHb21 
•jTrMDHb22 
'TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHbS 0 
TrMDHbS 1 
TrMDHbS 2 



BAApSAAAACTGTTCTTCCTCTCTTAATCTTCCCTGTTffiGATjTCCTTCCA,^^^ 
lAAfelAAAACTGTTCTTCCTCTCTTAATCTTCCCTGTTgjGATTCCTTCCAiTTCT 
jAACAAAAACTGTTCTTCCiCTCTTAATCTTCCCTGTTCGATTCCTTCgATTTCT 
"SAACAAASGCT^TTCTTCgTCTCTTAATCTTC^f^ 




'^S^flH^^^^^^^AAACT;ATTCTTS^^ 

TCTPfwAACAAAAACT^TTCTTC^TCTCTTAATCTTC^CgGTTCGA 
TclJXjBAACAAAAACTGTTCTTCC^^ 

TCTCr|gAACHAAAACTGgTCTTCCTCTCTTAAHCTTCCCTGTTCGAT|TCCTTCCA|TTCT 

TCTCTNAAcjjAAAACTGTTCTTCCTCTCTTAATCTTCCCTGTTjfGAT|TCCTTCCAiGTT 

TCTCTBgAj|©AAAACTGTTCTTCCTCTCTTj|ATCTTCCCTGTTCGAT|TCCTTCC;^ 

TCTH|iAAA^AAAACT§TTCTTB|TCTCTTAATCTTC^^^ 

TwSSlAApAAAAACTGTTCTTCClCTCTTAATCTTCC 

tctctBaaa:Saaaactgttcttcctctcttaatcttccctgttcgattccttcca|ttct 

tctctnaacpaaaactgttcttcctctctt§atcttccctgttggat|tccttccbct 

tcte|^caaaaactgttcttcbictcttaatcttccctgttcgat|tccttcgatt 

tctct|aacBa?^aactgttcttcctctcttaatcttccctgtt|gaT|TccttccBgttct 

tctH§naacaaaaactgttcttcB1ctcttaatcttccctgttcgaT|Tcct^ 

gCTCTMAACAAAAACTGTTCTTCCTC§CTTpATCTTCCCTGTT§GAT|TCCTTCCHGTTCT 

'"™"^aacaaaaactgttcttcb8ctctt^atcttccctgttcgat|tccttc|m 

caaaaactgItcttcctctcttoIatcttccctgttcgatiTCCttccBIttct 
ctgttcttcctctcttjsatcttccctgttcgatjtccttccblttct 
[|^ttcttcctctctt^JaPcttccctgtt|gattccttccaSttct 

GTTC TTCBiCTCTTAglfcTTCCCTGTTCGAT TCCTTC§ATTTCT 
^^TTCcfcTCTTgmTCTTCCCTGTTffiGATTCCTTCCBiTTCT 
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TrMDHb2 

TrMDHb3 
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TrMDHbS 
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TrMDHbS 

TrMDHb 9 

TrMDHbl 0 

TrMDHbll 

TrMDHbl2 

TrMDHbl3 

TrMDHbl4 

TrMDHb 15 

TrMDHbie 

TrMDHbl? 

TrMDHbie 

TrMDHbl9 

TrMDHb20 

TrMDHb21 

TrMDHb22 

TrNDHb23 

TrMDHb24 

TrMDHb25 

TrMDHb2 6 

TrMDHb27 

TrMDHb28 

TrMDHb29 

TrMDHb30 
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TrMDHb32 



tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagggcaaattggt 

tcaaaaatggcc/^jy^gacccagttcgtgttctcgtcactggtgctgcagggcaaatt^ 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcaggScaaattggt 

TCAGgAATGGCCAAAGACCCAGTTCGTGT|CTCGT|ACTGGTGCTGCAGGgCAAAT^ 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagggcaaattggt 

f-Eprr? n3 ^ . I ry 



m fitr* T — m 

tcabgaatggccaaagacccagttcgtgtgctcgtgactggtgctgcaggp!caaattggt 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagg|!caaattggt 

tcaaaaatggccnaagacccagttcgtgttctcgtcactggtgctgc&gggcaaattggt 

tcaaaaatggcciaagacccagttcgtgttctcgtcactggtgctgc^gggcaaattggt 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagggcaaattggt 

tca^5g;aatggcc7iaagacccagttcgt§t^]ctcgt|bactggtgctgck^ 

tcaaaaatggccaaa^acccagttcgtgttctcgtcactggtgctgcagg;^^^ 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagg'gcaaattggt 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagggcaaattggt 

;tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagg||caaattggt 

jtcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagggcaaattggt 

jtcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgckggjscaaattggt 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgckggjgcaaattggx 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagggcaaattggtI 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgcagggcaaattggt| 

tbwaaaatggccaaagacccagttcgtgttctcgtcactggtgctgckgggcaaattggti 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgckgggcaaattggt| 

tcaaaaatggccaaagacccagttcgtgttctcgtcactggtgctgckgggcaaattggt 

THwAAAATGGCCAAAGACCCAGTTCGTGTTCTCGTCACTGGTGCTGckGG§CAAATTGGT 
TcBAAAATGGCcHAAGACCCAGTTCGTGTTCTCGTCACTGGTGCTGckGGGCAAATTGGT 




TrMDHbl 
TrMDHb2 
TrMDHb3 
TrMDHb4 
TxMDHbS 

■ErMDHb? 
T^MDHbS 
TrMDHb9 
TrMDHbl 0 
Tr:MDHbll 
TrMDHbl2 
TrMDHbl3 
TrMDHbl4 
TrMDHblS' 
TrMDHbie 
TrMDHbl? 
TrMDHblS 
•ft:MDHbl9 
•I?rMDHb20 
TrMDHb21 
. lirMDHb22 
' TrMDHb23 
TrMDHb24 
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TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHbSO 
TrMDHb31 
TrMDHb32 



TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGAjTCAACCTGTGATC 
TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGATCAACCTGTGATC 

tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgaItcaacctgtgatc 
tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 
tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 
tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 
tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 

TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGAjTCAACCTGTGATC 

tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 

TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGATCAACCTGTGATC 

tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 

TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGATCAACCTGTGATC 

tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 

TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGATCAACCTGTGATC 

tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 
tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 

TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGAjTCAACCTGTGATC 
TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGATCAACCTGTGATC 

tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 

TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGAjTCAACCTGTGATC 

tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 
tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 
tatgcacttgtccctatgattgctaggggagtgatgcttggtcctgatcaacctgtgatc 

TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGATCAACCTGTGATC 
TATGCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGAjTCAACCTGTGATC 
TATgCACTTGTCCCTATGATTGCTAGGGGAGTGATGCTTGGTCCTGA.TCAACCTGTGATC 
TATGCACTTGTCCCTATGATTGCTA GGGGAGTGATGCTTGGTCCTGATCAACCTGTGATC 
■■^^^'>^^'^WM^BBMMGGGGAGTGATGCTTGGTCCTGAf^H^TACCTGTGATC 
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TrMDHbl 
TrMDHb2 
TrMDHbS 
frMDHb4 
':FrMDHb5 
TrMDHb6 
TrMDHb? 
TrMDHbS 
TrMDHb9 
TrMDHbl 0 
TrMDHbll 
TrMDHbl2 
TrMDHbia 
TrMDHbl4 
TrMDHblS 
TrMDHbl 6 
TrMDH]3l7 
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TrMDHb20 
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cttcacatgcttgatatPjCctccagcagcagagtcattgaatggagttaaSatggagttg 
cttcacatgcttgatat|gcctccagcagcagagtcattgaatggagttaa$^atggagttg 

CTTCACATGCTTGATATTCCTCCAGCAGCAGAGTCATTGAATGGAGTTAAGATGGAGTTG 
CTTCACATGCTTGATAT[§CCTCCAGCAGCAGAGgCATTGAATGGAGTTAAgATGGAGTTG 
CTTCACATGCTTGATATfejcCTCCAGCAGCAGAGTCATTGAATGGAGTTATii^ 

cttcacatgcttgatat||cctccagcagcagagtcattgaatggagttaj^tggagttg 

CTT,gACATGCTTGATATTgCT|CAGmGNAGAGTiATTGAATGGAG|TAA|ATGGAG|TG 
CT§CACATGCTTGATATTCC§CCP:GCAG*CAGAGTCATTGAATGGAGT 

cttcacatgcttgatattcctccagcagcagagtcattgaatggagttaagatggagttg 
ctScacatgcttgatattccIccSIgcagcagagtcattgaatggagttaagatggagttg 
cttcacatgcttgatat^cctccagcagcagagtcattgaatggagttaaJSatggagttg 

CTTCACATGCTTGATATTCCTCCAGCAGCAGAGTCATTGAATGGAGTTAAyATGGA.GTTG 
CTp;CACATGCTTGATATTCC'§CCpGCAGCAGAGTCATTGAATGGAGTTAAGATGGAGTTG 
CT'^CACATGCTTGATATTCC'^CCgGCAGCAGAGTCATTGAATGGAGTTTiAGATGGAGTTG 
CTTCACATGCTTGATATTCCTCCAGCAGCAGAGTCATTGAATGGAGTTAA^TGGAGTTG 
CTTCACATGCTTGATATTCCTCCAGCAGCAGAGTCATTGAATGGAGTTAAGATGGAGTTG 

cttcacatgcttgatattcctccagcagcagagtcattgaatggagttaaSatggagttg 
cttcacatgcttgatattcctccagcagcagagtcattgaatggagtta?^tggagttg 
cttcacatgcttgatattcctccagcagcagagtcattgaatggagttaagatggagttg 
cttcacatgcttgatattcctccagcagcagagtcattgaatggagtttaaSatggagttg 
ctpcacatgcttgatattccacc^jgcagcagagtcattgaatggagttaagatggagttg 

CTiicACATGCTTGATATTCC^CclqGCAGCAGAGTCATTGAATGGAGTTAAGATGGAGTTG 

cttcacatgcttgatattcctgcagcagcagagtcattgaatggagttaagatggagttg 
cttcacatgctxgatattcctccagcagcagagtcattgaatggagttaaSatggagttg 

CTTCACATGCTTGATATTCgTCCAGCAGCAGAGTCATTGAATGGAGTTAAGATGGAGTTG 
CTTBgCATGCTTG ATATgCCTCCAGCAGCAGAGTCATTGAATGGAGTTAA§ATGGAGTTG 
^rfjjjBHSSSJH^T ATTCCTrac|G C AGCg^ 
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251 
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TrMDHbl 
TrMDHfa2 
TrMDHbS 
TrMDHb4 
TrMDHbS 
TrMDHb6 
TrMDHb? 
TrMDHbS 
TrMDHbS 
TrMDHbl 0 
trMDHbll 
TrMDHbl2 
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TrMDHbl4 
TrMDHblS 
TrMDHb 16 
TrMDHbl7 
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GTCGATGCTGCATTTCCACTTgTTAj^GGTGgTGSTG|jTACAACTG7iTG^:TG^T^ 
GT^GATGCTGCATTTCCACTTCTTAAAGGTGTTGTTGCTACAACTG/iTGTTGTTGAAGCA 
GTyGATGCTGCATTTCCACTTCTTAAAGG^;GTTGTTGCTACAACTGziTGTTGTTGAAGCA 
GTCGATGCTGCATTTCCACTTCTTAAAGGTGTTGTTGCTACAACTG^iTGTTGTTGAgGCA 
GTj§GATGCTGCATTTCCACTTCTTAAAGG^GTTGTTGCTACAACTGi^TGTTGTTGAAGCA 
G^GGATGCTGCATTTCCACTTCTTAAAGGG^ 
GT|GGATGCTGCATTTC^ACTTCTTAAAGGd^ 
GG^GATGCTGNATTNK^ 

gtcgatgctgcatttccacttcttaaaggtgttgto 

gtcgatgctgcatttccacttcttaaaggtgttgttgctacaactgtJtgttgttgaggca 
gtcgatgctgcatttccacttcttaaaggtgttgttgctacaactg^JtgttgttgaSgca 
gt^^gatgctgcatttccacttcttaaaggggttgttgctacaactgalxgttgttgaagca 
gtggatgctgcatttccacttcttaaaggtgttgttgctacaactg;iltgttgttgaa 
gtcgatgctgcatttccacttcttaaaggtgttgttgctacaactgatgttgttga^jgca 
gtcgatgctgcatttccacttcttaaaggtgttgttgctacaactgatgttgttgagpca 

GT.gGATGCTGCATTTCCACTTCTTAAAGGTGTTGTTGCTACAACTG?iTGTTGTTGAAGCA 

gtcgatgctgcatttccacttcttaaaggtgttgttgctac?lactg7^tgttgttgaSgca 
gt'sgatgctgcatttccacttcttaaaggtgttgttgctacaactgtiltgttgttgaagca 
gtggatgctgcatttccacttcttaaaggtgttgttgctacaactgi^tgttgttgaagca 
gtcgatgctgcatttccacttcttaaaggtgttgttgctaca^ctgaItgttgttgaSgca 
gtggatgctgcatttccacttcttaaaggtgttgttgctacaactgjiltgttgttgaagca 

GTCGATGCTGCATTTCCACTTCTTAAAGGTGTTGTTGCTACAACTG?JtGTTGTTGAGjGCA 

gtcgatgctgcatttccacttcttaju\ggtgttgttgctacaactga!tgttgttgaggca 
gtcgatgctgcatttccacttcttaaaggtgttgttgctacaactgaltgttgttgaggca 
gtggatgctgcatttccacttcttaaaggtgttgttgctacaactgaltgttgttgaj^gca 
gtcgatgctgcatttccacttcttaaaggtgttgttgctacaactgj^tgttgttgaSgca 
gtggatgctgcatttccacttcttaaagg:g|ttgttgctacaactg;^tgttgttgaagca 
gtcgatgctgcatttccacttcttaaaggtgttgttgctacaactga'tgttgttga^Sgca 
gtcgatgctgcatttccBcttcttaaaggtgttgttgctacaactgaItgttgttgaggca 




TrMDHbl 



TrMDHb2 - 
*rMDHb3 
TrMDHb4 
TrMDHbS 
TrMDHbS 
TrMDHbV 
TrMDHbS 
TrMDHb9 
•TrMDHbl 0 
TrMDHbll 
TrMDHbl2 
TrMDHbl3 
TrMDHbl4 
T?rMDHbl5 
TrMDHbl6 
TrMDHblV 
TrMDHblS 
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TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 
TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 
TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 
TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 
TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 
TGCACTGGAGTCAATATTGCAG TCATGGTTGGTGGATTCCCAAGAAAAGT^GGTATGGAG 

TGC'GCTGpAB3G§ATAT|G|c]b?NgBBBBSSDCE^ 

TGCACTGGAGTCAATATlgGCAGTCATGGTTGGTGGATTCCCAAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 

TGCACTGGAGTCAATATgGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAikGAAGGTATGGAG 

TGCACTGGAGTCAATATpGCAGTCATGGTTGGTGGATTCCCAAGAAijAGAAGGTATGGAG 

TGCACTGGAGTCAATAT|GCAGTCATGGTTGGTGGATTCCCAAGAA2jAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAATjjAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTO^ 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAAAGAAGGTATGGAG 

TGCACTGGAGTCAATATiGGCAGTCATGGTTGGTGGATTCCCAAGAAi^iAGAAGGTATGGAG 

TGCACTGGAGTCAATAT^GCAGTCATGGTTGGTGGATTCCCAAGAAisb\GAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAA^jAGAAGGTATGGAG 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAjjAGAAGGTATGGAG 

tgcactggagtcaatattgcagtcatggttggtggattcccaagaapIagaaggtatggag 

TGCACTGGAGTCAATATTGCAGTCATGGTTGGTGGATTCCCAAGAAJ^GAAGGTATGGAG 

tgcactggagtcaatattgcagtcatggttggtggattcccaagtvaaagaaggtatggag 

TGCACTGGAGTCAATATTGCAG TCATGGTTGGTGGATTCCCAAGAAAAGAAGGTAT GGAG 
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AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGGTTCTGCCCTTGAA 
AGGAAGGATGTGATGpCTAAGAASGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 
AGGAAGGATGTGATG!pCTAAGAAT|GTCTCTATTTACAAGTCCCAGGGTTCi§GCCCTTGAA 
AGGAAGGATGTGATG&TAAGAAOjGTCTCTATTTACAAGTCCCAGGqTTCTGCCCTTGAA 



AGGAAGGATGTgaATGTCTAAGAACGTCTCTATTTACAAGTCCCAgGGTTCTGCCCTTGAA 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTG/iA 

AGGAAGGATGTgATGTCTAAGAACGTCTCTATTTACAAGTCCCAgGCjTTCTGCCCTTGAA 

AGGAAGGATGTGATGjSCTAAGAApjGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

AGGAAGGATGTGATG|cTAAgAAiGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

AGGAAGGATGX^|ATGTCTAAGAACGTCTCTATTTACAAGTCCCaS^GC|TTCTGCCCTT 

AGGAAGGATGT[DATGTCTAAGAACGTCTCTATTTACAAGTCCCA^GCjTTCTGCCCTTGAA 

AGGAAGGATGTGATG^CTAAGAAffjGTCTCTATTTACAAGTCCCAGGGTTCTGCCCTTGAA 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGC|TTCTGCCCTTGAA 

AGGAAGGATGTGATG!SCTAAGAA|teTCTCTATTTACAAGTCCCAGGCjTTCTG 

AGGAAGGATGTGATGicTAAGA^TGTCTCTATTTACAAfi'SOT 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGCi^ 

AGGAAGGATGTGATG§CTAAGAAgGTCTCTATTTACAAGTCCCAGGC|TTCTGCCCTTGAAj 

AGGAAGGATGTSATGTCTAAGAACGTCTCTATTTACAAGTCCCASGCjTTCTGCCCTTGAA 

AGGAAGGATGtP|aTGTCTAAGAACGTCTCTATTTACAAGTCCCA^GC|TTCTGCCCTTGAA 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

AGGAAGGATGTGATG§CTAAGAA§GTCTCTATTTACAAGTCCCAGGCp:TCTGCCCTTGAA 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

AGGAAGGATGTGATG;§CTAAGAASGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

AGGAAGGATGTGATGTCTAAGAACGTCTCTATTTACAAGTCCCAGGCjTTCTGCCCTTGAA 

5lfeJ3Ai?.Gm?GTGATGTCTAAilAACGTCTCTATTTACAAGTCCCAGGC}TTCTGCCCT 
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?y:LGCATGCTGCTGCCAACTGCAAGGTTTTGGTT|cTTGCTAACCCAGdAAA'I|ACCAATGCA| 
AAGCATGCTGCTGCCAACTGCAAGGTTTTGGTT^TTGCTAACCCAGaAAAjm 

aagcatgctgctgccaactgcaaggStttggttgj|,tgctaacccaScB|aacaccaatgca^ 

aagcatgctgctgccaactgcaaggttttggtT'^ttgctaacccagoaaa^Baccaat^ 

aagcatgctgctgccaactgcaaggttttggttpttgctaacccaggaaaj|!accaatgca 

AAGCATGCTGCTGCCAACTGCT^GGTTTTGGTTlTTGCTAACCCAGqAAAlTACCAATGCA 



AAGCATGCTGCTGCCAACTGCAAGGTTTTGGTTGTTGCTAACCCAGGAAACACCAATGCA 
AAGCATGCTGCTGCCAACTGCAAGGTTTTGGTTGTTGCTAACCCAGqH^CACCAATGCA 
AAGCATGCTGCTGCCAACTGCAAGGTTTTGGTTGTTGCTAACCCAGGAAACACCAATGCA 

aagcatgctgctgccaactgcaaggttttgg tt'gttgctaacccagoaaajsaccaatgca 

aagcatggtgctgccaactggaaggttttggQ2E^BSEB^SBD9BS&SI^^BI^B 

aagcatgctgctgccaactgcaaggttttggttgttgctaacccagoaaacaccaatgca 

AAGCATGCTGCTGCCAACTGCAAGGTTTTGGTTGTTGCTAACCCAGQAAACACCAATGCA 

aagcatgctgctgccaactgcaaggntttgggtittgctaacccasoaaagaccaatgca 
aagcatgctgctgccaactgcaaggttttggttgttgctaacccagc|Haacaccaatgca 
aagcatgctg ctgccaactgcaaggttttggttvgttgctaacccagqaaagaccaatgc^ 

aa^gatngtgSHBDBDBBBSBB&BSBBBBBBBS^^ 

aagcatgctgctgccaactgcaaggttttggttgttgctaacccagoaaacaccaatgca 

aagcatgctgctgccaactgcaaggttttgg.tt§ttgct7^cccagqaaa'gaccaatgca 
aagcatgctgctgccaactgcaaggttttggttgttgctaacccaggaaacaccaatgca 
aagcatgctgctgccaactgcaaggttttggttgttgctaacccaggaaacaccaatgca' 
aagcatgctgctgccaactgcaaggttttggttgttgctaacccaggaaacaccaatgca 
aagcatgctgctgccaactgcaaggttttggttsttgctaacccaggaaagaccaatgca 
aaggatgctgctgccaactgcaaggttttggttgttgctaacccaggaaacaccaatgca 
aagcasgctgctgccaactgcaaggttttggttgttgctaacccagcpuuigaccaatgca; 
aagcatgctgctgccaactgcaaggttttggttgttgctaacccaggaj^acaccaatgc/^ 
aagcatgctgctgcgaactgcaaggttttggttgttgctaacccagcaaacaccaatgca 
aagcatgctgctgccaactgcaaggttttggttgttgctaacccagcaaacaccaatgca 
aagcatgctgctgccaactgcaaggttttggttgttgctaagccaggaaacaccaatgca 




TrMDHbl 
TrMDHb2 
TrMDHb3 
TrMBHb4 
TrMDHbS 
TrMDHbS 
TrMDHb? 
TrMDHbS 
TrMDHbS 
TrMDHbl 0 
TrMDHbll 
TrMDHbl2 
TrMDHbl3 
TrMDHbl 4 
TrMDHbl 5 
TrMDHbl6 
TrMDHbl7 
TrMDHblS 
TrMDHbl 9 
TrMDHb20 
TrMDHb21 
jTrMDHb22 
TrMDHb23 
T^MDHb24 
,TrMDHb25 
TrMDHb26 
TrMDHb2 7 
TrMDHb28 
TrMDHb29 
TrMDHbS 0 
TrMDHb31 
TrMDHb32 



TTGATCTTGAAGGA;GTTTGCTCCATCTATTCCAGAGAAAAACATTTCp;gTTTGACTAGA 

ttgatcttgaagga:gtttgct ccatctattccagagaaaaacattt^ 

ttgatcttggaggaatgngctqbbbbxqsdibxdlsb^^ 

ttgatcttgaaggaStttgctccatctattccagagaaaaacatttc|:g|tttgactaga 

TTGATCTTGAAGGAbTTTGCTCCATCTATTCCAGAGAAAAACATTTC|G.GTTTGACTAGA 
TTGATCTTGAAGGAGTTTGCTCCATCTATTCCAGAgAAAAACAT^^ 

SflS^^^XBEDBBEBSEEBSBBSnSSSBBBBBBBBBB 

TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTJJGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTG7U\GGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGA§TTTGCTCCATCTATT CCAGAGAAAAACATTTC-^GG:TTTGACTAGA 

TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 

TTGATCTTGAAGGApTTTGCTCCATCTATTCCAGASfAAAAACATTTCpgTTTGj^BB 

TTGATCTTG7iJN.GGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 

TTGATCTTGAAGGAgTTTGCTCCATCTATTCCA GAGAAAAACATTTCjgGGTTTGACTAGA 

TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGAlTTTGCTCCATCTATTCCAGAGAAAAACATTTC'kgiTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCpGTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGAgTTTGCTCCATCTATTCCAGAGAAAAACATTTC'A^TTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGA@TTTGCTCCATCTATTCCAGAGAAAAACATTTC^gTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCpGTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCTTGTTTGACTAGA 
TTGATCTTGAAGGAATTTGCTCCATCTATTCCAGAGAAAAACATTTCjTTGTTTGACTAGA 



574 
571 
531 
564 
565 
564 

558 
559 
561 
555 

559 
558 
550 
553 
550 

543 
539 
530 
529 
526 
523 
517 
393 
345 
342 
184 
184 




TrMDHbl 
TrMDHba 
TrMDHb3 
TrMDHb4 
T-rMDHbS 
TrMDHbS • 
TrMDHb? 
TrMDHbB 
TrMDHbS 
TxMDHblO 
TrMDHbll 
TrMDHbl2 
TrMDHbl 3 
TrMDHbl4 
TrMDHblS 
TrMDHbie 
TrMDHbl? 
TrMDHblS 
TrMDHbl 9 
TrMDHb20 
TrMDHb21 
]T>rMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHb30 
TrMDHb31 
TrMDHb32 



620 



640 



660 



CTTGATCACAi^ 
CTTGATCACAACAGGGCATTGG 



CTTGATCACi 

cttgatcacaacagggcattgggccaaatttctgaaag 
cttgatcacaacagggcattgggccaaatttct' 



CTTGATCAC 
CTTGATCACig 

cttgatcacaacagggcattgggccaaattt 
cttgatcacaacagggcattgggccaaatt 



CTTGATCACAACAG: 
CTTGATCACAACAGGGCATTGGGCCAAATTTCTGAAAGATTGAAT 



CTTGATCACAACAGGGCATTGGGCCAAATTTCTGAAAG 

cttgatcacaacagggcattgI 



cttgatcacaacagggcattgggccaaatttctgaaagattgI 

CTTGATCACAACAGGGCATTGGGCCAAATTTCTGAAAGATTGAATigTfrCAAGTTTCTGATi 

cttgatcacaacagggcattgggccaaatttctgaaag' 
cttgatcacaacagggcattgggccaaatI 

CpTGATCACAACAGGGCATTGGGCCAAATTTCTGTVAAGATTGAATGTTCAAGTTTCTGAT 

cttgatcacaacagggcattgggccaaatttctgaaagattgaatKttcaagtttc tgat 

CTTGATCACAACAGGGCATTGGGCCAAATTTCTGAAAGATTGAATGT|TCAAGTTTC 
CTTGATCACAACAGGGCATTGGGCCAAATTTCTGAAAGATTGAATSTiTCAAGTTTCTGAT 
CTTGATCACAACAGGGCATTGNGCCAAATTTCTGAAAGATTGAATGTj|CAAGTTTCTGAT 
CTTGATCACAACAGGGCATTGGGCCAAATTTCTGAAAGATTGT^TGT^GCAAGTTTCTGAT 

cttgatcacaacagggcattgggccaaatttctgaaagattgaatgttcaagtttctgat 
cttgatcacaacagggcattgggccaaatttctgaaagattgaatgt,tcaagtttctgat 




TrMDHbl 
TrMDHb2 
TrMDHb3 
TrMDHb4 
TrMDHbS 
TrMDHb6 
TrMDHb? 
TrMDHbS 
TrMDHb 9 
TrMDHbl 0 
TrMDHbll 
TrMDHbl2 
TrMDHbl3 
TrMDHbl4 
TrMDHblS 
TrMDHbie 
TrMDHbl? 
TrMDHbie 
TrMDHblS 
TrMDHb20 
TrMDHb21 
^TrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHbS 0 
TrMDHbS 1 
TrMDHbS 2 



680 



700 



720 



GTAAAGAATGT 



610 



646 



513 
465 
462 
304 
304 



^GAATGTCATTATCTGGG GTAATCATTCATCAACTCAGTATC C|ri 



'GATGTCAACCAT 



GTAAAGAATGTCATTATCTGGGGTAATCATTCATCAACTCAGTATCCtrGATGT^ACCMr 
GTAAAGAATGTCATTATCTGGHGTAATCATTCATCAACTCAGGATCCfTGATGTCAACCAT 
GTAAAGAATGTCATTATCTGGGGTAATCATTCATCAACTCAGTATCCITGATGTCAACCAT 
GTAAAGAATGTCATTATCTGGGGTAATCATTCATCAACTCAGTATCCfTGATGTCAACCAT 
gTAAAGAATGTCATTATCTGGGGTAATCATTCATCAACTCAGTATCCITGATGTCAACCAT 




TrMDHbl 
TrMDHb2 
TrMDHb3 
TrMDHb4 
TrMDHbS 
TrMDHbe 
TrMDHb? 
TrMDHbS 
TrMDHbS 
TrMDHblO 
TrMDHbll 
TrMDHbl2 
TrMDia>13 
TrMDHb 14 
TrMDHblS 
TrMDHbl 6 
TrMDHbl? 
TrMDHbl 8 
TrMDHbl 9 
TrMDHb20 
TrMDHb21 
■jTrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHbS 0 
TrMDHbS 1 
TrMDHbS 2 



740 



760 



780 



GCAACTGTTT^ACACCCCCGCTGGGGAGAAGCCTGTCCGTGAGCTTGTTTCTfiATnarnpr 



706 



562 
515 
522 
364 
364 



GCAACTGTTAACACCCCCGCgGGGGAGAAGCCTGTCCGTGAiAjCTTGTlrTB^^^^^^H 

gcaactgttaacacccncgctgnhgagaagcctgSccgtgagctS^ 

GCAACTGTTAACACCCCCGCTGGGGAGAAGCCTGTCCGTGAGCTTGTTTCTGAT^ 

gcaactgttaacacccccgctggggagaagcctgtccgtgagcttgtttctgatgacgcc 

GCAACTGTTAACACCCCCGCTGGGGAGAAGCCTGTCCGTGAGCTTGTTTCTGATGACGCC 




TrMDHbl 
TrMDHb2 
TrMDHbS 
TrMDHb4 
TrMDHbS 
TrMDHbS 
TrMDHb? 
TrMDHbS 
TrMDHbS 
TrMDHb 10 
TrMDHbia 
TrMDHbl2 
TrMDHbia 
TrMDHb 14 
TrMDHb 15 
TrMDHbl 6 
TrMDHbl? 
TrMDHbia 
TrMDHblS 
TrMDHb20 
TrMDHb21 
jTrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHb30 
TrMDHb31 
TrMDHbS 2 



800 



820 



840 



TGGTTGAATGGAGAATTCATAT CTACCGTTCAACAflrr.TnriTnr 



TGGTTGAATGGAGAATTCATATCTACCGTTCAACAACGTGGTGCTGGAATTATTAAGGCT 
TGGTTGAATGGAGAATTCATATCTACCGTTCAACAACGTGGTGCTGdAATTATTAXGGCT 
TGGTTGAATGGAGAATTCATATCTACCGTTCAACAACGTGGTGCTGdAA^^ 



752 



582 
424 
424 



AGAAAGCTTTCAAGCGCACTATCCGCTGCTAGCGCTGCTTGCGAg^S^^ggn.^:^^^ 




TrMDHbl 
TrMDHb2 
TrMDHba 
TrMDHb4 
trMDHbS 
TrMDHbS 
TrMDHb? 
TrMDHbB 
TrMDHbS 
TrMDHblO 
TrMDHbll 
TrMDHbl2 
TrMDHbia 
TrMDHb 14 
TrMDHblS 
TrMDHbl6 
TrMDHbl? 
TrMDHblS 
TrMDHblS 
•TrMDHb20 
TrMDHb21 
I TrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHb30 
TrMDHb31 
TrMDHb32 




920 



940 



TrMDHbl 
. TrMDHb2 
TrMDHbS 
TrMDHb4 
TrMDHbS 
TrMDHbe 
TrMDHb7 
TrMDHbS 
TrMDHb9 
TrMDHbl 0 
TrMDHbl 1 
TrMDHbl2 
TrMDHbl3 
TrMDHbl4 
TrMDHblS 
TrMDHbie 
TrMDHbl? 
TrMDHbie 
TrMDHbl9 
TrMDHb20 
TrMDHb21 
)TrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHbS q 
TrMDHb31 
TrMDHb32 



960 



GTTCTTGGAACTCCCCAGGGCACCTTCGTTTCT^TGGGAGTGTATTGTGATGGTTCTTAC 
GTTCTTGGAACTCCCCAGGGCACCTTCGTTTCAATGGGAGTGTATTGTGATGGTTCTTAC 



TrMDHbl 
TrMDHb2 
TrMDHba 
TrMDHb4 
TrMDHbS 
TrMDHbe 
TrMDHb? 
TrMDHbe 
TrMDHbS 
TrMDHblO 
TrMDHbll 
TrMDHbl2 
TrMDHbl3 
• TrMDHbl4 
TrMDHbXB 
TrMDHbie 
TrMDHbl? 
TrMDHbl 8 
TrMDHbl9 
TrMDHb20 
TrMDHb21 
j TrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHb30 
TrMDHb31 
TrMDHb32 



980 



1000 



1020 



aacgtaccagctggactcatctattcattccctgtcaccactgctaAtggggaatggaaH 

ftACGTACCAGCTGGACTCATCTATTCATTCCCTGTCACCACTGCTAATGGGGAATGGaaa 



603 
604 



) 



TrMDHbl 
TrMDHb2 
TrMDHbS 
TrMDHb4 
TrMDHbS 
TrMDHbe 
TrMDHb? 
TrMDHbS 
TrMDHbS 

• TrMDHbl 0 
TrMDHbl 1 
TrMDHbl2 
TrMDHbl3 
TrMDHbl4 
TrMDHblB 
TrMDHbl 6 
TrMDHbl? 
TrMDHbl 8 
TrMDHbl 9 
TrMDHb20 
TrMDHb21 

)TrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHbS 0 
TrMDHbS 1 
TrMDHb32 



1040 



1060 



1080 



^TTGTTCAAGGACTTTCAATTGACGAGTTCTCAAGGAAGAAGTTGGACTTRACAnrTnAa 



664 



I 



TrMDHbl 
TrMDHb2 
TrMDHb3 . 
TrMDHb4 
TrMDHbS 
TrMDHbe 
TrMDHb? 
TrMDHbB 
TrMDHb9 
TrMDHbl 0 
TrMDHbll 
TrMDHbl2 
*TrMDHbl3 
TrMDHbl4 
TrMDHblS 
TrMDHbie 
TrMDHblV 
TrMDHblS 
TrMDHbl 9 
TrMDHb20 
TrMDHb21 
tTrMDHb22 
TrMDHb23 
TrMDHb24 
TrMDHb25 
TrMDHb26 
TrMDHb27 
TrMDHb28 
TrMDHb29 
TrMDHb30 
TrMDHbS 1 
TrMDHbB 2 



* 1100 



GAGTTATCCGAGGAAAAGAGTTTGGCATACT 



695 



Tgure 61 Consensus contig nucleotide sequence of TrMDHc 

* 20 ' * . 40 ' . * 60 
TrMDHc : i^GNGAATTGOAATATACGACaVCTCCATTCCATACTTC 

* -80 . * 100 * 120 
TrMDHc : " GCTCTCTCTCTCTTTATTCTCGAAAAGCTTTTTCAGCCAACAACGGAGAQAATT 

* * 140 • * 160 * * 180. 

TrMDHc : CCGTCGATGCTCAGATCCGTCCAATCAGCCGTCTCCCGCGCCTCTTCTCACCTAACCCGC 

* 200 * 220 * 240 
TrMDHc : CGTGGCTATGCTACCGAACCAGTTCCAGAACGaVAGGTGGCaVTTCTCGG 

* 260 * 280 * 300 
TrMDHc : GGGATCGGCG?^CCTCTCTCTCTTCT<»TGAAGCTCA^ 

) • . 

* 320 * 340 * 360 

TrMDHc : CTTTATGATATTGCTGGAACCCCTGGTGTCGCCGCTGATGTCAGCCACATCAACTCCAGA 

* 380 * 400 * 420 
TrMDHc : TCTGAGGTAACTGGGTATGC:aVGGTGA?U3AAGAGCTTGGAAAAGCm 

* 440 * 460 * 480 
TrMDHc : GTTGTTATAATTCCTGCTGGTGTGCCCAGAAAGCCTGGAATGACTCGTGATGATCTTT^ 

* * 500 * 520 * 540 
TrMDHc : AATAOTAAaSCTGGO^TTGTCAAGTCACTTGCCACTGCTATTTCTAAGTACTGCCCCC^^ 

* 560 * 580 * '600 
TrMDHc : GCCCTTGTTAACATGATAAGCAACCCTGTGAACTCCACCGTTCCCA 

* 620 * 640 * 660 
TrMDHc : TTCAAGAAGGCAGGGACATATGACGAGAAGAGATTGTTTGGGGTTAGA^ 

* 680 ♦ 700 * 720 
TrMDHc : GTC^IGGGCAAAAACTTTCTATGCCGGGAAAGCTAAAGTTCCT^GTTGCCGAGGTCAATG 

* 740 * 760 * 780 
TrMDHc : CCTGJTTATAGGAGGCCATGCa^GGAGTTACTATTCTTCCATTATTTTOT 

* 800 * 820 * * 840 
TrMDHc : CAAGCO^TCTGGGTGATGATACCCTTAAGGNTTTAACGGNANG^ 

* 860 * 880 * 900 
TrMDHc : ACAGAAGTTGNGACCGCCaUlGGCIGGAAAGGGTTCTGCAACTTTGTC^ 



) 



TrMDHc 



* 920 * 940 * 960 

GQAGCCATATTTGCTGATGCTNGCCrOWVAGGNCTGAATGGAGTTCCAGAT^ 



* * 1^00 * 1020 
TrMDHc : TGCTCATATQTGCAATCCaATATCATCTCTGACCTTCCTTO . io20 

* * lOSO * 1080 
TrMDHc : ATTGGGAAGAATGGTGTGGAAGAAATTCTGGGCTTAGGTTCTCTCA(^GATTTCGA : 1080 



1100 * . 1120. * 1140 

: 1140 



TrMDHc : CAAGGCCTTGAAAACCTOUVGGCTXSAACTCAAATCATCTATTGAAAAGGGA^ 



* 1160 * 1180 * 1200 
TrMDHc : GCCTCCCy^GTAATCGAAOVTGTCATACATTACTGGATTTOTCCATTTAGAACC^ : 1200 

« 1«T^« * ^^^^ * 1240 • * 1260 

TrMDHc : ATTTTGCAAATTCAGAACAATTGTTTGTAATGTTGCCGGTAGGTATACCCCTAGATTTAA i 1260 

* 1280 * 1300 * 1320 
^TrMDHc : TAAGTAAATCTGCGAGAGCAGTTTATTGCTGCAGGGACTGAAATTAAAA^^ : 1320 

* 1340 * 1360 * 1380 
TrMDHc : TTGGCCTTTCCmTTCGTAATGGCCCTTaiTTGTTaca^TGNTTTC^^ : 1330 

* 1400 

TrMDHc : GGGTGNTGGNCANC6ATACACANCCCCC : 1408 



gure 62 Deduced amino acid sequence of TrMDHc 



* 20 • * 40 ♦ 60 
TrMDHc : ^^«>Sly^;JRSVQSAVSRASSHLTRRQYATEPVPERKVAILGAAGGIGQP^^ : 60 

* -80 ' ** 100 * 120 
TrMDHc : IiSLYDIAGTPGVAADVSHINSRSEVTGYAGEEBLGKMiBGADWIIPAGVPROT : 120 

* 140 * 160 * 180 
TrMDHc : LPNINAGIVKSLATAISKyCPHALVWMISNPVNSTVPIAAEVFKKAG^ : 180 

* 200 * 220 * 240 
TrMDHc I DVVRAKTFYAGKAKNnPVAEVNVPVIGGHASVTIIiPliPXQATPQAN^ : 240 

* 260 * 280 * 300 
TrMDHc : GGTEVXTAKAGKGSATIiSMAYAGAIFADAXLKXLNGVPDVIECSyVQSNlISDljPFFASK : 300 

i 

* 320 * 340 

TrMDHc : VRIGKNGVEEILGLGSLTDFEQQGLENLKAELKSSIEKGIKFASQ : 345 



i 



Figure 63 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contig sequence TrMDHc 



20 



40 



TrMDHc 1 
TrMDHc 2 * 
TrMDHc 3 
TrMDHc4 
TrMDHcS 
TrMDHc 6 
TrMDHc? 
TrMDHcS 
TrMDHcS 
TrMDHclO 
TrMDHc 11 
TrMDHc 12 
TrMDHc 13 
TrMDHc 14 
TrMDHclS 
TrMDHc 16 
TrMDHc 17 



60 



) 



TrMDHcl 

TrMDHc2 

TrMDHc3 

'rrMDHc4 

TrMDHcS 

TrMDHc 6 

TrMDHc? 

TrMDHcS 

TrMDHcS 

TrMDHclO 

TrMDHc 11 

TrMDHc 12 

TrMDHcl3 

TrMDHcl4 

TrMDHclS 

TrMDHc 16 

TrMDHcl? 



cSf^TcgAHeS 



— ^ 1---^ err T-ic3 iy """^ — — 

TCC.gTgCgP|^T1§CTTTpWjTTTOTCG^ 

|ACTHcCATTCCNTACTTTN|TTl5lTN|gCg 

■"^ClTCCHTTCCNTJACTTTHOTTg^^^ 

d|TCCAT|cCNTACTTTliNrTT|^jTCGCT 

§lT|cgTTCCgTACTTTHCiTTC||5cg 

■T§CCATTCCNTACTTTW^TT^TNr^C^ 

TCCATTCCNTJACT^TjETTT^^TCGCT 



80 



100 



120 



CTCTCTCTCTTTATTCTCGAAAAGCTTgTTCAGCCAACAACGBAGAdAAT?;^iATGAGGCCGTCG 

ctctctctBtBtattctcgaaaagctttttcagccaacaacgBagagaataIatgaggccgtcg 

CTCTCTCTgTTTATTCTCGAAAAGCTTTTTCAGCCAlCAACGGAGAGAATTATGAGgcCGTCG 
CTCTCTCTCTlTATTCTCGAAAAGCTTTTTCAGCCgACAACGpAGAGAATgATGAGGCCGTCG 
CTCTCTCjHTTTATTCTCGAAAAGCTTTTTCAGCCAACAACGGAGAGAATTATGAGGCCGTCG 

ctctctcqBtttattctcgaaaagctttttBagccaacaacggagagaattatgaggccgtcg 

CTCTCTCTCTTTATTCTCGAAAAGCTTTTTCAGCCAACAACGGAGAGAATTATGAGGCCGTCG 

ctctctctctttattctcgaaaagctttttcagccaacaacgHagagaatISatgaggccgtcg 

CTCTCTCT|T|TATTCTCGAAAAGCTTTTTHl3GCCAiCAACGGAGAGAATTATGAGGCCGTCG 

gccaacaacggagagaattatgaggccgtcg 

G C cHjjfAAC G G AG AG AATT ATG AGG C CG TCG 
TTCTCgAAAAGCTTTTTBAGCclACAACGHAGAGAAB|ATGAGGCCGTCG 
TTCTCGHAAAGCTTTTTCAGCCBACAACGNAHAdAATiATGAGGCCGTCG 



60 
59 
42 
30 
27 
27 
27 
27 
25 
25 



122 
119 
105 
91 
88 
87 
90 
89 
87 
86 
48 
46 
48 



140 



TrMDHcl 
TrMDHc 2 
TrMDHc 3 
TrMDHc4 
TrMDHcS 
TrMDHc6 
TrMDHc? 
TrMDHcS 
TrMDHc 9 
TrMDHclO 
TrMDHc 11 
TrMDHc 12 
TrMDHc 13 
TrMDHc 14 
TrMDHclS 
TrMDHc 16 
TrMDHcl? 



160 



180 



ATGCTCAGATCCGTCCAATCAGCCGTlTCCCGCGCCTC&CTCACCiAACCCGCCGTGGCTAT 

atgctcagatccgtgcaatcagccgt:§tcccgcgcctc|tctcacctaacccgccgtggctat 

ATGCTCAGATCCGTCCAATCAGCCGTCTCCCGCGCCTCTTCTCACCTAACCCGCCGTGGCTAT 

atgctcagatc|gtccaatcagccgt|tcccgcgcctc|tctcacc'i?aacccgccgtggctat: 

ATGfTjTCAGATCCGTCCAATCAGCCGTCTCCCGCGCCTCTTCTCACCTAACCCGCCGTGGCTAT 
ATGjdjTCAGATCCGTCCAATCAGCCGTCTCCCGCGCCTCTTCTCACCTAACCCGCCGTGGCTAT; 
ATGgTCAGATCCGTCCAATCAGCCGTCTCCCGCGCCTCTTCTCACCTAACCCGCCGTGGCTAT 

atgctcagatccgtccaatcagccgt-ItcccgcgcctcIgtctcaccJaacccgccgtggctat 
atg|tcagatccgtccaatcagccgtctcccgcgcctcttctcacctaacccgccgtggctat 
atgjtjtcagatccgtgcaatcagccgtctcccgcgcctcttctcaccofaacccgccgtggctat 
atggtcagatccgtcgaatcagccgtctcccgcgcctcttctcacctaacccgccgtggctat 

ATGCTCAGATC;fpTCcBATCAGCCGT|TCCCGCGCCTC@TCTCACCTAACCCGCCGTGGGTAT 
ATGCTCAGA TCgjGTCCAATCAGCCGTj^TCCCG'GGCCTCGTGTCACnTPAACrGGCCGTnG&'TAT 



185 
182 
168 
154 
151 
150 
153 
152 
150 
149 
111 
108 
111 



200 



220 



240 



TrMDHcl 

TrMDHc2 
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TrMDHclS 

TrMDHc 16 

TrMDHcl? 



GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGj^GCTGGCGGCGGGATCGG-A^CAG 
GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGgGCTGGCGGCGGGATCGG^^CAG 
GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGCGCTGGCGGCGGGATCGGCCAG 
GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGgGCTGGCGGCGGGATCGGgCAG 
GCTACCGAACCAGTTCCAGAACGCT^AGGTGGCCATTCTCGGCGCTGGCGGCGGGATCGGCCAG 
pCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGCGCTGGCGGCGGGATCGGCCAG 
GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGCGCTGGCGGCGGGATCGGCCAG 
GCTACCGAACCAGTTCCAGAACGCAgGGTGGCCATTCTCGGgGCTGGjJGGCGGGATCGGSCAG 
GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGCGCTGGCGGCGGGATCGGCCAG 
GCTACCGAACCAGTTCCAGAACGCAAGGgGGCCATTCTCGGCGCTGGCGGCGGGATCGGCCAG 
GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGCGCTGcicGGCGGGATCGGCCAG 
GCTACCGAACCAGTTCCAGAACGCAAGGTGGCCATTCTCGGCGCTGGgjGGCGGGATCGGCCAG 
GCTACCGAACCAGTTCCAGAACGCAAGGWGGCCATTCTCGGSiGCTGGCGGCGGGATCGGlScAG 
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TrMDHclS 
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CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAAgcCTCTCGTTTCAACCCTATCrTCTTTATGATATTGCT 
CCTCTCTCTCTTCTCATGAAGCTCAACCCTCTCGTTTCAACCCTATGTCTTTATGATATTGCT 
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TrMDHclS 

TrMDHcl6 

TrMDHcl? 



360 



ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtat 
ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtat 
ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtatI 
ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtat 
ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtat 

GGAACCCCTGGTGTCGCCGCTGATGTCAGGCACATCAACTCCAGATGTGAGGTAACTGGGTAT, 

ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtatI 

GGAACCCCTGGTGTCGCCGCTGATGTCAGCCACATCAACTCCAGATGTGAGGTAACTGGGTAT 

ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtatJ 

GGAACCCCTGGTGTCGCCGCTGATGTCAGCCACATCAACTCCAGATGTGAGGTAACTGGGTATj 

ggaacccctggtgtcgccgctgatgtcagccagatcaactccagatgtgaggtaactgggtatj 
ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtat 
ggaacccctggtgtcgccgctgatgtcagccacatcaactccagatgtgaggtaactgggtati 
^gtgtcgcc gctg|Pgtcagccacatcaactcca1a1gtgaHgtaactgggtat 
gStg atgtIng c cHg atHaactc cHg at gtg ag gtaactggg tat! 
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•frMDHcia 
TrMDHc 14 
TrMDHclB 
* TrMDHcie 
TrMDHcl? 



GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTTATAATTCCTGCTGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTTATAATTCCTGCTGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTG'IfTATAATTCCTGC^GGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGOfTATAATTCCTGCTGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGOj'TATAATTCCTGC^GGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTj'TATAATTCCTGCjGGGT 
GCAGGTGAAGAAGAGCTTGGAATVAGCTTTGGAGGGTGCTGATGTTGTTATAATTCCTGC^^GGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTTATAATTCCTGCTGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTTATAATTCCTGCpGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGaj*TATAATTCCTGCj&jGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTTTATAATTCCTGCgGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTTTATAATTCCTGCTGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTGTTATAATTCCTGCTGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTG1FTATAATTCCTGCTGGT 
GCAGGTGAAGAAGAGCTTGGAAAAGCTTTGGAGGGTGCTGATGTTG'li'TATAATTCCTGCTGGT 
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TrMDHcl 
^ TrMDHc2 
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TrMDHclB 
TrBOHcie 
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gtgcccagaaagcctggaatgactcgtgatgatcttttcaatattaAcgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta^cgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttjpaatattajI^gctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta^cgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatattatIcgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta^lcgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta;icgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta^cgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatattaticgctggcattgtgaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta^cgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatttuicgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatattajIcgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta^cgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta;{cgctggcattgtcaag 
gtgcccagaaagcctggaatgactcgtgatgatcttttcaatatta^icgctggcattgtcaag 
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TrMDHcl 
1 TrMDHc2 
TrMDHcS 
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TrMDHcS 
TrMDHc? 
TrMDHcS . 
TrMDHc9 
TrMDHclO 
TrMDHcll 
TrMDHcl2 
TrMDHclB 
trMDHcl4 
TrMDHc is 
TrMDHcie 
TrMDHcl? 



TCACTTGCCACTGCTATTTCTAAGTACTGCCCCCATq 
TCACTTGCCACTGCTATTTCTAAGTACTGCCCCCATGCCCTTGTTAiiCATGATAAGCAACCCT 
TCACTTGCCACTGCTATTTCTAAGTACTGCCCCCATGCCCTTGTTAjicATGATAAGCAACCCT 
TCACTTGCCACTGCTATTTCTAAGTACTGCCCCCATGCCCTTGTTAidcATGATAAGCAACCCTj 
TCACTTGCCACTGCTATTTCTAAGTACTGCCCCCATGCCCTTGTTAiicATGATAAGCAACCCT 
TCACTTGCCACTGCTATTTCTAAGTACTGCCCCCATGCCCTTGTTA^icATGATAAGCAACCCT 
TCACTTGCCACTGCTATTTCTAAGTACTGCCCCCATGCCCTTGTTAiicATGATAAGCAACCCT 

tcacttgccactgctatttctaagtactgcccccatgcccttgttaiicatgataagcaaccct 
tcacttgccactgctatttctaagtactgcccccatgcccttgtta^icatgataagcaaccct 
tcacttgccactggtatttctaagtactgcccccatgcccttgtta^icatgataagcaaccct 
tcacttgccactgctatttctaagtactgcccccatgcccttgttaicatgataagcaaccct' 
tcacttgccactgctatttctaagtactgcccccatgcccttgttapIcatgataagcaaccct 
tcacttgccactgctatttctaagtactgcccccatgcccttgttailcatgataagcaaccct 
tcacttgccactgctatttctaagtactgcccccatgcccttgtta;{catgataagcaaccct 
tcacttgccactgctatttctaagtactgcccccatgcccttgttatIcatgataagc aaccct 
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TrMDHcl 

TrMDHc2 

TrMDHc3 
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TrMDHc? 

TrMDHcS 

TrMDHc 9 
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TrMDHc 12 

TrMDHc 13 . 

TrMDHcl4 

TrMDHcXB 

TrMDHcl6 

TrMDHcl? 



GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAAG GCAGGG 
GTGAACTCCACCGTTCCCATTGCTGCAGg! 
GTGAACTCCACCGTTCCCATTGCTGCAGAGG 

GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAAGGCAGGGACATAT 
GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAAGGCAGGGACATATGACGAGAAG 
GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAA GGCAGGGACATATGAGGAGAAG 

GTGAACTCCACCGTTCCCATTGCTGC] 

GTG?^CTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAAGGCAGGGACATATGACGAGAAG 
GTGAACTCCACCGTTCCCATTGCTGlAGAGGTTTTCAAGAAGGCaGGGACATATGAC§AGAA| 
GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAAGGCAGGGACATATGACGAGAAG 
GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGTiLAGGCAGGGACATATGACGAGAAG 
GTGAACTCCACCGTTCCCATTGCTGCAKAGGTTTTCAAGAAGGCAGGGACATATGAC'NAGAAG 
GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAAGGCAGGGACATATGACGAGAAG 
GTGAACTCCACCGTTCCCATTGCTGCAGAGGTTTTCAAGAAGGCAGGGACATATGACGAGAAG 
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AGATTGTj 

AGATTnTTTGGGGTTACAACCCTTGATGTAGTCAGGGCgAAAACTTt^^tTATGCCGGGAAAGCT 



AGATTGTTTGGGGTTACAACCCTTGATGTAGTCAGGGC:G.AAAACTT'l!'CTATGCCGGGAAAGCTj 
Ai^TTGTTTGGGGTTHCAACCCTTG ATGTAGTCAGGGpGAAAACTTTgTjgTGCCGGGAAA^ 
AGATTGTTTGGGGTTACAACCCTTG 

AGATTGTTTGGGGTTACAACCCTTGATGTAGTCAGGGCAAAAACT 
AGATTGTTTGGGGTTACAACCCTTGATGTAGNCAGGGCAAAAACTTTjgTATGC'^GGGAAAGCT 
AGATTGTTTGGGGTTACAACCCTTGATGTAGTCAGGGCAAAAACTTTCTATGC||GGGAAAGCT 
AGATTGTTTGGGGTTACAACCCTTGATGTAGTCAGGGCAAAAACTTTfCTATGC^GGGAAAGCT 
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AAAGTTCCAGTTGCCGAGGTCAATGTACCTGTTgjTgGGAGGCCATGGAGGAGTTACTATTixITT 



AAAGTTCCAGTTGCCGAGGTCAATGTAC 











NGAG 




TWCTATT 



AAAGTTCCAGTTGCCGAGGgCAATGgACCTGTTATAGGAGGCCATGdAGGAGTTACTATTCT| 

aaagttccagttgccgaggtcaatgtacctgttataggaggccatggaggagttactattct.| 
aaagttccagttgccgaggtcaatgtacctgttataggaggccatggaggag ttactatt ctIg 
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82 



CCATTATTTTNT^AGGHAACACCTNAAGCCAATNTGGigiTGATGAlJAC^^^ 



CClQTTSTTTTgaTiiAGGBSAl^ 



T.NGGWGATWA5a!A 



782 



769 



CCATTATTTTWTjSAGGCAACACCTWAAGCCAATgTGGGTG^ 

CCATTATTTT^|TCAGGCAACACCTCAAGCCAATCTGGSTGATGATAcb||TTAAG 

CCATTATTTTfiTCAGGCAACACCTCAAGCCAATCTGGiTGATGATAclcj^TTAAGci^^ 



TrMDHcl 
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GNANGG!gc)^JCAAGATGGGGGAAClGAAHTTGNGACCGCCAAGG!CT^ 



W|^GpCN,CAA2^1JGBGGGAJ\ 



g^Janggac^caa^anggaggaaca^aaSi^ 

gsa^ggacacaagatggaggaacagaagttg^gaccgccaaggctggjaaagggttctgc^ 

GgA|;GGACACAAGATGGAG GAACAGAAGTTGfcACCGCCAAGGCTGGAAAGGGTTCTGCAACT 



827. 



801 



801 
558 
'545 



900 



TrMDHcl 
^ TrMDHc2 
TrMDHcS 
TrMDHc4 
TrMDHcS 
TrMDHc6 
TrMDHc7 
TrMDHc8 
TrMDHcS 
- TrMDHclO 
TrMDHcll 
TrMDHcl2 
TrMDHcl 3 
TrMDHcl4 
TrMDHclS 
TrMDHcl€ 
TrMDHcl7 



920 



940 



TTQNN:AATGG^ 

TTGTCAATGGCTTATGCTGGAGCCATATTTGCTGATGCTgGCCTCAAhGGfflCTGAATGGAGTT 



TTGTCAATGGCTi 



ctgwtgctngcctBnaSggwctgaatggagtt 

IgIgiSgtt 
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CCAGATGTTATTGAGTGCTCATATGTGCAATCCAATATCATCTCTGA'CCTTNCTTTCTTTGCT 



JMilMiMMi i ii l iffi 
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TCCAAGGTGAGGATTGGGAAi'7AATGGTGTGGG[A5WAAT 



7Z2 



TCCAAGGNWgGGATTGGGAAGAATGGTGTGGAAGAgATTCTC 

TCCjSAGGTGAGGATTGGGAAGAATGGTGTGGAAGAAATTCTGGGCTTlAGGTTCTCTCACAGATi 
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TTCGAGCAACAAGGCCTTGAAAACCTCAAGGCTGAACTCAAATCATGTATTGAAAAGGG7VATC 
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^^AATTTGCCTCCCAGTAATCGAACATGTCATACATTACTGGATTTT'irCCATTTAGAACCAGAT 
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[TGCAAATTCAGAACAATTGTTTGTAATGTTGCCGGTAGGTATACCCCTAGATTTAA 
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Itaagtaaatctgcgagagcagtttattgctgcagggactgat^ttaaaaccagttttaggttgI 
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GCCTTTCCATTCGTAATGGCCCTTCATTGTTGCATGiSITTTCATATA^^TGCAATTGAAGGGTGN 
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TGGNCANCGATACACANCCCCC 
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Figure 64 Consensus contig nucleotide sequence .of TrMDHd 



* 20 * 40 * 60 
TrMDHd : GGGTAGGCGGAGATTTNAACCCATTTTCCTCTTiUVATCTCTCTC^ : 60 

* 80 * 100 * 120 
TrMDHd : CCCaiTTACCATTCATTCCCaVGAGGTCGJWSATGGCAGaVTCAGCAGC^ : 120 

* 140 * 160 * 180 
TrMDHd : TTGGAACTGCCCAAACAGGGAGGCC:».CTTCCTCAATCAAACCCTTTTGQTTTCA^ ; 180 

* * 200 * 220 * 240 
TrMDHd s ATTCCCAGGTTAATTTTAAGACCTTCTCTGGTCTCAAGGCCATG^ : 240 



* 260 * 280 * • 300 

TrMDHd : AGTCTGAATCSVTCTTTCTTTGGCAACGAAACTAGTGCTGCTCTGC^^ : 300 



' * 320 * 340 * 360 

•TrMDHd : CCAAAGCTCyUUU\GGAAAACCAAAACa^TCAACCGaUVTT^ : 360 



* 380 * 400 * 420 

TfrMDHd : AAGTGGCGGTTCTTGGTGCTGCAGGAGGAATTGGTCAGCCAC^^^ : 420 



* 440 * 460 * 480 

TrMDHd : TGTCGCCnTTGGTTTCCX3ACCTGCATCTTTATGATATCGCGJ^ : 480 



* 500 * 520 * 540 

TrMDHd : CTGATATCAGTCATTGCAACACrCCTTCAAAGGTTTTGGATTTCACAGGTGCT^ : 540 



* 560 * 580 * 

TrMDHd : TGGaUVATTGTTTGAAAGGTGTGGATGTAGTTGTTATACCmK:TGGTGTTC : 5 98 



Figure 65 Deduced amino acid sequence of TrMDhd 

* 20 * 40 * 60 
TrMDHd : MAASIUATFTIGTAQTGRPLPQSNPFGLKWSQVNFKTFSGLKA^ : 60 

* 80 * 100 * 120 
TrMDHd : ' TSAALRATFAPKAQKBNQNINimLHPQASYKVAVIiGAAGGIGQPIiALLI^ : 120 



★ 140 * 160 

TrMDHd : 'yDIMJVKGVAADISHCNTPSKNOiDFTGASELAM J 169 




Figure 66 Nucleotide seq[uence8 of nucleic acid fragments contributing to the 
consensus contig sec[uence TrMDHd 



bgGTAGGCGGAGATTTNAACCCATTTTCCTCTTAAATCTCTCTWAACT^ 

B'GgjTAGG CGGAGATTgNAACCCATTTTCCTCTTAAATCTCTCTCBACTiTCTCTTTCCATT 

BBHBBBgGGAGATTTNAACCCATTTTCCTCTTAAATCTCTcBc^^CTiTCTCjTTCCATT 



TrMDHdl 
TrMDHd2 
TrMDHda 
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120 



CCCATTACCATTCATTCCCAGAGGTCGAGATGGCAGCATCAGCAGCAGjCTACTTTTACTA 
CCCATTACCATTCATTCCCAGAGGTCGAGATGGCAGCATCAGCAGCAG'CTACTTTTACTA 
CCCATTACCATTCATTCCCAGAgGTilGAGATGGCAGCATCAGCAGCAG.CTACTTTTACTA 



120 
118 
112 



TrMDHdl 
TrMDHd2 
TrMDHd3 



140 



160 



180 



TTGGAACTGCCCAAACAGGGAGGCCACTTCCTCAATCAAACCCTTTTGGTTTGAAAGTCA 
TTGGAACTGCCCAAACAGGGAGGCCACTTCCTCAATCAAACCCTTTTG|GTTTGAAAGTCA 
TTGGAACTGCCCAAACAGGGAGGiScACTTCCTCAATCAAACCCTTTTGGTTTGAAAGTCA 



200 



220 



240 



TrMDHdl 
TrMDHd2 
TrMDHd3 



ATTCCCAGGTTAATTTTAAGACCTTCTCTGGTCTCAAGGCCATGTCATjCTCTAAGATGCG 
ATTCCCAGGTTAATTTTAAGACCTTCTCTGGTCTCAAGGCCATGTCAIjCTCTAAGATGCG 
ATTCCCAGGTTAATTTTAAGACCTTCTCTGGTCTCAAGGCCATGTC^TjCTCTAAGATGCG 



240 
238 
232 



TrMDHdl 
TrMDHd2 
TrMDHd3 



260 



280 



300 



agtctgaatcatctttctttggcaacgaaactagtgctgctctgcgtgcaacttttgcac 
agtctgaatcatctttctttggcaacgaaactagtgctgctctgcgtgcaacttttgcac 
agtctgaatcatctttctttggcaacgaaactSigtgctgctctgcgtgcaacttttgcac 



TrMDHdl 
TrMDHd2 
TrMDHd3 



320 



340 



360 



CCAAAGCTCAAAAGGAAAACCAAAACATCAACCGCAATTTGCATCCTdAGGCATCCTACA 
CCAAAGCTCAAAAGGAAAACCAAAACATCAACCGCAATTTGCATCCxdAGGCATCCTACA 
CCAAAGCTCAAAAGGAAAACCgAAACATCAACCGCAATTTGCAlcCTQAGGCATCCTACA 



360 
358 
352 



TrMDHdl 
TrMDHd2 
trMDHd3 



380 



400 



420 



AAGTGGCGGTTCTTGGTGCTGCAGGAGGAATTGGTCAGCCACTGGCAGTTCTCATTAAG 
AAGTGGCGGTTCTTGGTGCTGCAGGAGGAATTGGTCAGCCACTGGCAGTTCTCATTAAGA 
AAGTGGCGGTTCTSGGTGCTGCAGGAGGAATTGGTCAGCCACTffiGCAGTTCTCATTAAGA 



420 
418 
412 



440 



460 



480 



TrMDHdl 
TrMDHd2 
TrMDHd3 



TGTCGCCTTTGGTTTCCGACCTGCATCTTTATGATATCGCGAATGTTAAGGGAGTTGCTG 
TGTCGCCTTTGGTTTCCGACCTGCATCTTTATGATATCGCGAATGTTiAGGGAGTTGCTG 
TGTCGCCTTTGGTTTCCGACCTGCATCTTTATGAiSATSGCGAATGTT^iAGGGAGTTGCTG 



500 



520 



540 



TrMDHdl 
TrMDHd2 
TrMDHd3 



TrMDHdl 
TrMDHd2 
TrMDHd3 



CTGATATCAGTCATTGCAACACTCCTTCAAAGGTTTTGGATTTCACAGGTGCTTCTGAGT 
CTGATATCAGTCATTGCAACACTCCTTCAAAGGTTTTGGATTTCACAGGTGCTTCTGAGT 
CTGATATCAGjCATTGCAACACTCCTTCAAAGGTTTTGGATTTCACAGGTGCTTCTGAGg 



TGGCAAATTGTTTG|^^B3BHB^^^SBDBSBBBB^B^C^BBDf]^&^B 

TGGCAAATTGTTTGAAAGGTGTGGATGTAGTTGTTATACCTGCTGGTGU 1 CLLfit^HS 
TjgGCAAATTGTTTGAAAGGTGTGGATGTaGTTGTTATACCTGCTGGTGTTCCFqAGAAJ^ 




Figure 67 Consensus contig nucleotide sequence of TrMDHe 

* 20 * 40 . * 60 

TrMDHe : TTNTNTTTATTTTATGTTTTTTNCCTCCT s ^0 • 



* 80 * 100 * 120 

TrMDHe : TCTCrdAATTATTATTAGTCCTTAGAAATGGAAGC^CATGC^^ s 120 



* 140 * 160 * 180 
TrMDHe : ATTGCa^GAATCTCTGCTCATCTTO^CCTCCAAA^ - 180 

* 200 * 220 * 240 
TrMDHe.: ATTAGCAAAGCTAACTGCAGAGCAAAAGGTGGGGCGCaK3GATTCaVAAGTAGC^ : 240 

* 260 * 280 * 300 
TrMDHe : GGGGCTGCTGGTGGAATTGGTCAATCCCTTTCTTTGCTGTTGAAGATCaATCCATTGGT^ : 300 

\ 

* 320 * • 340 . * 360 
TrMDHe : TCAGTTCTTCATCTTTATQATGTTGTO^aiCTCCTGGTGTCACTGCTGATGTO : 360 

* . 380 * 400 * 420 
TrMDHe ": ATTGACACCGGTGCTGTGGTTCGTGGCTTTCTAGGGCa^GGCACAACTTaAGAATGCA 420 

* 440 * 460 * 480 
TrMDHe : ACAGGCa^TGGACTTGGTCGTTATACCTGCTGGTGTGCCGAGGAAACCrGGAATGA : 480 



* 500 520 . * 540 

TrMDHe : GATGACTTATTTAAGATAAATGCTGGAATTGTGAGGACrCOTAGCQAAGG : 540 



* 560 * 580 * * 600 
TrMDHe : AGCTQTCCTAATGCyUlTTGTCAACTTGATTAGCAATCa^GTGAAra s 600 

* 620 * 640 * • 660 
TrMDHe : GCTGCTGAGGTTTTCAAGAAAGCCGGTACATATGATCCAAAGCGACTTTTAGGGGTTA^ : 660 

* 680 * 700 * . 720 
TrMDHe : ACCCTOGATGTTGTGAGGQO^TACCTTTGTGGCAGAAGTACTTGGTGTTGATCC^ : 720 



* 740 * 760 * 780 

TrMDHe : GAGGTTGATGTTCCAGTGGTAGGAGGGCACGCAGGAGTCSICAATATTACCTCTTI^ : 780 



* 800 * 820 * 840 
TrMDHe : aVGGTTAAGCCTCCCAGTAGCTTCACCGCAGAAGAiUVCCGAATACCTGAa^^ = 840 

* 860 * 880 * 900 
TrMDHe : CAAAANGGCGGAAOlCAAGTTGTTGAGGCaUUVGGCTGGGGCTGGTTCGGC^ s 900 



* 920 • * 940 * 960 

TrMDHe : ATGQCCTATGCAGCTGCCAAGTTTGCTAACGCATGCCTCCGTGGCTTG^ : 960 




* 980 * 1000 * 1020 

rMDHe : GGGATAGTGQAGTGTGCTTTTGTTGATTCTCAGGTTACGGiU^C^ : 1020 



* 1040 * 1060 * 1080 
CrMDHe : AAGGTTCGTCTTGGTCGCGGTCGAGCyVGAAGAGATATATO^ACTTGGTCTCCra : 1080 

* 1100 * 1120 * 1140 
TrMDHe : TATGAGAGGATTGGATrAGAAAAAGCXSAAGAAAGAGTTAGCAGGAAGCATCCT^ : 1140 



* 1160 * 1180 * 1200 

TrMDHe : GTAGAATTCATCAAAA2iAAAANAAAGATAAGGAAAAATTAGrn^ s 1200 



* 1220 * 

TtMDHe : ATATCTATAAAGAACTTGT6TAATAATTCC : 1230 



68 Deduced amino acid sequence of TrMDHe 



igure 

* 20 * 40 * 60 
TrMDHe : MBAHAAGi^QRIi^lSAHLQPPNFQBGGDVAISKANCRAKGGAPGFKVAILGAAGGIGQS : 60 

* 80 * * "° 
trMDHe ! I.SI.LI.IaNPLVSVLHLyDVVOTPG^^?ADVSHIDTQAVVRGPIKK^AQLBNALTGra)^^^ : 120 

140 * • 160 * 180 

TrMDHe : AGVPWCPGMTRDDLFKIHAGIWTLSBGIAKSCPraOCVNLISNPVNSTVPIAAEVPKKAQ s 180 

« 

* 200 • * 220 . . * . 240 
TrMDHe : TYDPKMJ.GVTTLDVVRANTFVAEVLGVDPREVDVP • 240 

* 260 * 280 • * 3ti0 
TrMDHe : ABBTBXLTXRIQXGGTQVVEAKAGAGSATLMAYAAAKFANACLRGLKGEAQIVECAFVDS : 300 

* 320 * 340 * 



TrMDHe : QVTEl.PFFAAKVM,GRGGABEiyQIiGPUIEYERIGI.EKAKKBIAGSIQROVBFIKKKXR : 



359 




Figure .69 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contig sequence TrMDHe • 



[ttntntttattttatgttttttncctcctacatataactcttmacti|ngcatacact ;ct 



TrMDHel 
TrMDHe2 
TrMDHe3 
TrMDHe4 
•TrMDHeS 
l?rMDHe6 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



80 



TrMDHel 
TrMDHe2 
TrMDHe 3 
TrMDHe 4 
TrMDHeS 
TrMDHe 6 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



100 



120 



TCTCTBAATTATTATTAGTCCTTgGAAATGGAAGCACATGCAGCTGdtrgCCAATCAGAGG 
TCTCTCAATTATTATTAGTCCTTAGAAATGGAAGCACATGCAGCTGGgGCCAATCAGAGG 
TCTCTCAATTATTATTO 

*"'*^^™*'*'™''™^ 7agtcct ta||aaatggaagcacatgcagctggagccHatcBgagg 

■■■■^^GAGAAATGGAAGCACATGCAGCTGGAGCCAATCAGAGG 

F—'rTifr- rr«. rr3 r--T.-*- 

Cg:A|s;TGCAGCTGqp:;GCCMCTIvllv^^^ 



140 



TrMDHel 
TrMDHe2 
TrMDHeS 
TrMDHe4 
TrMDHeS 
TrMDHe 6 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



160 



180 



ATTGCAAGAATCTCTGCTCATCTTCAgCCTCCAAATTTCCAGGAAGdAGGTGATGTTGCA 
ATTGCAAGAATCTCTGCTCATCTTCAACCTCCAAATTTCCAGGAAGdAGGTGATGTTGCA 
ATTGCAAGAATCTCTGCTCATCTTCAACCTCCAAATTTCCAGGAAGGAGGTGATG.TTGCA 
ATTGCAAGAATCTCTGCTCATCTTCHicCTCCAAJVTTTCCAGGAJ^GGASGTGATGT.f'G^^ 
ATTGCAAGAATCTCTGCTCATCTTCAACCTCCAAATTTCCAGGAAGGAGGTGATGTTGCA 
ATTGCHAGAATCTCTGCTCATCTTBKrACCTCCBAATTTCCAGGAAGdAGGTGATGTTGCA 



200 



TrMDHel 
TrMDHe2 
TrMDHe3 
trMDHe4 
TrMDHeS 
TrMDHe 6 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



220 



240 



attagcaaagctaactgcagagcaaaaggtggggcgccgggattcai^iagtagcaatcttg 
attagcaaagctaactgcagagcaaaaggtggggcgccgggattcaaIagtagcaatcttg 
attagcaaagctaactgcagagc|aaaggtggggcgccgggattcaa1agtagcaatcttg 
attagcaaagctaactgcagagcaaaaggtggggcgccgggattca^jagtagcaatcttg 

ATTAGCAAAGCTAACTGCAGAGCAAAAGGTGGGGCGCCGGGATTCAiAGTAGCAATCTTG 
ATTAGCAAAGCTAACTGCAGAGCAAAAGGTGGGGCGCCGGGATTC/dAGTAGCAATCTTG 




TrMDHel 
TrMDHe2 
TrMDHe3 
TrMDHe4 
TrMDHeS 
TrMDHe6 
TrMDHe7 
TrMDHeS 
TrMDHeS 
TrMDHel 0 



trMDHel 

TrMDHe2 

TrMDHeS 

TrMDHe4 

TrMDHeS 

TrMDHe6 

TrMDHe? 

TrMDHeS 

TrMDHeS • 

TrMDHelO 



TrMDHel 
TrMDHe2 
TrMDHeS 
TrMDHe4 

TrMDHeS 
TrMDHee 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



TrMDHel 
TrMDHe2 
TrMDHeS 
^ TrMDHe4 
frMDHeS 
trMDHee 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



TrMDHel 
TrMDHe2 
TrMDHeS 
TrMDHe4 
TrMDHeS 
TrMDHee 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



GGGGCTGCTGGTGGAATTGGTCAATCCCTTTCTTTGCTGTTGAAGATCAATCCATTGGTT 
GGGGCTGCTGGTGGAATTGGTCAATCCCTTTCTTTGCTGTTGAAGATCAATCCATTGGTT 
GGGGCTGCTGGTGGAATTGGTCAATCCCTTTCTTTGCTGTTGAAGATCAATCCATTGGTT 
GGGGCTGCTGGTGGAATTGGTCAATCCCTTTCTTTGCTGTTGAAGATCAATCCATTGGTT 
GGGGCTGCTGGTGGAATTGGTCAATCCCTTTCTTTGCTGTTGAAGATCAATCCATTGGTT 
GGGGCTGCTGGTGGAATTGGTCAATCCCTTTCTTTGCTGTTGAAGATCAATCCATTGGTT 



TCAGTTCTTCATCTTTATGATGTTGTCAACACI^CT^^TeACTCdTGSSSS 

TCAGTTCTTCATCTTTATGATGTTGTCAACACTCCTGGTGTCACTGGTGATGTTAGTCAC 

TCAGTTCTTCATCTTTATGATGTTGTCAACACTCCTGGTGTCACTGGTGATGTTAGTCAC 

TC,^GTTCTTCATCTTTATGATGTTGTCAACACTCCTGGTGTCACTGGTGATGTTAGTCAC 

TCgGTTCTTCATCTTTATGATGTTGTCAACACTCCTGGTGTCACTGGTGATGTTAGTCAC 
TCAGTTCTTCATC 



2SS 
243 
243 
223 
218 
203 



35S 
303 
• 303 
283 
27S 
263 



ATTGACACCGGTGCTGTGGTTCGTGGCTTTCTAGGGCAGGCACAACTTGAGAATGCACTT 
ATTGACACCGGTGCTGTGGTTCGTGGCTTTCTAGGGCAGGCACAACTTGAGAATGCACTT 
ATTGAaiACCGGTGCTGTGGTTCGTGGCTTTCTAGGGCAGGCACAACTTGAGAATGCACTT 
ATTGACACCGGTGCTGTGGTTCGTGGCTTTCTAGGGCAGGCACAACTTGAGAATGCACTT 
ATTGACACCGGTGCTGTGGTTCGTGGCTTTCTAGGGCAGGCACAACTTGAGAATGCACTT 
^^^^^^^^^^^roG^^^^GGGCAGGCACAACljTGAGAATGCACTT 



419 
363 
363 
343 
338 
323 
22 



ACAGGCATGGACTTGGTCGTTATACCTGCTGGTGTGCCGAGGAAACdTGGAATGACAAGG 
ACAGGCATGGACTTGGTCGTTATACCTGCTGGTGTGCCGAGGAAACa'TGGAATGACAAGG 
ACAGGCATGGACTTGGTCGTTATACCTGCTGGTGTGCCGAGGAAAcdTGGAATGACAAGG 
ACAGGCATGGACTTGGgCGTTATACCTGCTGGTGTGCCGAGGAAAcdTGGAATGACAAGG 
ACAGGCATGGACTTGGTCGTTATACCTGCTGGTGTGCCGAGGAAACGTGGAATGACAAGG 
ACAGGCATGGACTTGGTCGTTATACCTGCTGGTGTGCCGAGGAAACGTGGAATGACAAGG 
ACAGGCATGGACTTGGTCGjTATACCTGCTGGTGTGCCGAGGAAACGTGGAATGaraanp-. 



gatgacttatttaagata^'gctgSttgtgagga^^ 

gatgacttatttaagataaatgctggaattgtgaggactcttagcgsIaggaattgccaag 

GATGACTTATTTAAGATAAATGCTGGAATTGTGAGGACTCTT,'SeSGii!AGGAATTGScAAG 
GATGACTTATTTAAGATAAATGCTGGAATTGTGAGGACTCTTTCiEGAlAGGAATTGicAAG 
GATGACTTATTTAAGATAAATGCTGGAATTGTGAGGACTCTT'lllrtei^GGAATTGliCAAG 

gatgacttatttaagataaatgctggaattgtgaggactcttagcga!aggaattgccaag 
gatgacttatttaagataaatgctggaattgtgaggactcttagcgaIaggaattgccaag 



47S 
423 
423 
403 
398 
383 
82 



539 
483 
483 
463 
458 
443 
142 




TrMDHel 

TrMDHe2 

TrMDHe3 

TrMDHe4 

TrMDHeB 

TrMDHee 

TrMDHe? 

TrMDHeS. 

TrMDHe9 

TrMDHelO 



560 



AGCTGTCCTAATGCAATTGTCAACTTGATTAGCAATCCAGTGAATTCtACTGTGCCAATT 
AGCTGTCCTAATGCAATTGTCAACTTGATTAGCAATCCAGTGAATTC'CACTGTGCCAATT 
AGCTGTCCTAATGCAATTGTCAACTTGATTAGCAATCCAGTGAATTC'CACTGTGCCAATT 

agctgtcctaatgcaattgtcaacttgattagcaatccagtgaattccactgtgccaatt! 
agctgtcctaatgcaattgtcaacttgattagcaatccagtgaattc'cactgtgccaatt' 
agctgtcctaatgcaattgtcaacttgattagcaatccagtgaattcIcactgtgccaatt 

AGCTGTCCTA ATGCAATTGTCAACTTGATTAGCAA TCCAGTGAATTr'rarTP,Tnrr.;snn^>r 



599 
543 
543 
523 
5i8 
503 
202 



TrMDHel 
TrMDHe 2 
TrMDHeS 
TrMDHe4 
TrMDHeS 
TrMDHee 
TrMDHe? 
TrMDHeB 
) TrMDHe 9 
TrMDHelO 



620 



GCTGCTGAGGTTTTCAAGAAAGCCGGTACATATGATCCAAAGCGACOlTTTASiGGGT-aACA 

gctgctgaggttttcaagaaagccggtacatH] ^'■■'^ 

gctgctgaggt|ttcaagaaagccggtacatat 

GCTGCTGAGGTdTTCAAGAAAGCCGGTACATATGATCCAAJViaCGACaiTTTAGGA'GTTACA 
GCTGGTGAGGT^TTCAAGAAAGCCGGNACATATGATCCAAASicifACTlTTTAiGGGTTACA 
GCTGCTGAGGTTTTCAAGAAAGCCGGTACATATGATCCAAAGCGACTjTTTAG^^^M 

gctgctgaggttttcaagaaagccggtacatatgat|caaagcgaci)tttaggggt|!aca 

■"TATGATCcHAiGCGACaiTTTAGGBGiTACA 



659 
574 
576 
583 
578 
555 
262 
28 



TrMDHel 
TrMDHe 2 
TrMDHeS 
TrMDHe4 
TrMDHeS 
TrMDHe 6 
TrMDHe? 
TrMDHeS 
TrMDHe 9 
TrMDHelO 



680 



ACCCTCGATGNTGT 



700 



720 



IvfCTTGGNGTTGMjCC.^Ag^ 



ACCCTCGATGTTGTGAGGGCAAATACCTTTGTGGCAGAAGTACTTGdTGTTGATCCAAGA 



ACCCTCGATGTTGTGAGGGCAAATACCTTTG 



GCAGAAGTACTTGGTGTTGATCCAAGA 



673 



593 
637 

322 
88 



TrMDHel 
TrMDHe2 
TrMDHe3 
TrMDHe4 
TrMDHeS 
TrMDHe6 
TrMDHe? 
TrMDHeS 
TrMDHe 9 
TrMDHelO 



740 



760 



780 



tJAGG.a.TgAT; HTTCCAifTGGTAGGAGG GCfe:CGCiMGGawTBar^n 



TTC C AGWGGTAGGAgGG C ACG CAl^G 
GAGGTTGATGTTCCAGTGGTAGGAGGGCAPnpanna r■n^^■7^ tt- CCTCTTTTCTCr. 



693 

381 
148 



TrMDHel 
TrMDHe2 
TrMDHe3 
TrMDHe4 
TrMDHeS 
TrMDHe 6 
TrMDHe? 
TrMDHeS 
rrMDHe9 
TrMDHelO 



800 



820 



840 



CAGGTTAAGCCTiCCAGTASCTTHACCGiAGAA|yu^CCGAATACC*GACAi^AWCGMATT 
£^GGTTAAGCCTCCCAGTAGCTTCAC|I;GCAGAAGAAACCGAATACC'^GACAA^^ 




TrMDHel 
TrMDHe2 
'TrMDHeS 
TrMDHe4 
TrMDHieS 
TrMDHeS 
trMDHe? 
TrMDHeS 
TrMDHe 9 
trMDHelO 



860 



880 



900 




473 
268 
42 
38 



TrMDHel 
TrMDHe2 
TrMDHe3 
TrMDHe4 
TrMDHeS 
TrMDHee 
TrMDHe 7 
TrMDHeS 
^ TrMDHeS 
TrMDHe XO 



920 



940 



960 



ATGGCigTATGCAGCTGCCAAGTTTGCTAACGCATGCCTCCGTGGCTTGAAAGGAGAAGCC 
ATGGCCTATGCAGCTGCCAAGTTTGCTAACGCATGCCTCCGTGGCTTGAAAGGAGAAGCC 
ATOGCgTAT GCAGCTGCC BAGTTTGCTAACGCATGCCTCCGTGGCTTOAAAGGAGAAGCC 



328 
102 
97 



980 



TrMDHel 
TrMDHe2 
TrMDHe3 
TrMDHe4 
TrMDHeS 
TrMDHe 6 
TrMDHe 7 
TrMDHeS 
TrMDHeS 
TrMDHelO 



1000' 



1020 



GGGATAGTGGAGTGTGCTTTTGTTGATTCTCAGGTTACGGAACTTCGTTTCTTTGCAGCC 
GGGATAGTGGAGTGTGCTTTTGTTGATTCTGAGGTTACGGAACTTCGTTTCTTTGCAGCC 
GGGATAGTGGAGTGTGCTTTTGTTGATTCTCAGGTTACGGAACTTCGTTTCTTTnrAnrr- 



388 

162 
157 



TrMDHel 
TrMDHe2 
TrMDHe 3 
TrMDHe4 
TrMDHeS 
TrMDHe 6 
4'rMDHe7 
TrMDHeS 
TrMDHeS 
TrMDHelO 



1040 



1060 



1080 



AAGGTTCGTCTTGGTCGCGGTGGAGCAGAAGAGATATAGCAACTTGGTCCCCTTAATGAG 
AAGGTTCGTCTTGGTCGCGGTGGAGCAGAAGAGATATATCAACTTGGTCCCCTTAATGAG 
AAGGTTCGTCTTGGTCGCGGTGGAGCAGAAGAGATATATCAACTTGGTCCGCTTAATGAG 



1100* 



TrMDHel 
TrMDHe2 
TrMDHeS 
TrMDHe4 
TrMDHeS 
TrMDHe 6 
TrMDHe 7 
TrMDHeS 
TrMDHeS 
TrMDHelO 



1120 



1140 



TATGAGAGGATTGGgTTgGAAAAAGCGAAGAAgGAGTTAGCgGGAAGCATCCAGAAGGGA 
TATGAGAGGATTGGATTAGAAAAAGCGAAGAAAGAGTTAGCAGGAAGCATCCAGAAGGGA 
TATGAGAGGATTGGATTAGAAAAAGCGAAGAAAGAGTTAGCAGGAAGCATCCAGAAGGGA 




1160 



1180 



1200 



TrMDHel 
TrMDHe2 
TrMDHe3 
TrMDHe4 
TrMDHeS 
TrMDHe6 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



GTAGAATTCATCAgAAAA|AA|^^AGATAAG^^ 
GTAGAATTCATCA^A^AAAAANAA^^^^E^BEBBBB 

GTAGAATTCATCAAAAAAAAAlNiB^^^^^^^^^^^^^^^^^^^^^^^^^B 



TrMDHel 
TrMDHe2 
TrMDHe 3 
TrMDHe4 
TrMDHeS 
TrMDHe 6 
TrMDHe? 
TrMDHeS 
TrMDHeS 
TrMDHelO 



1220 



ATATCTATAAAGAACTTGTGTAATAATTCC 



598 




Figure 70 Consensus contig nucleotide sequence of TrMDHf 



* 20 . * 40 * 60 

TrMDHf : GNNTAOMGCTATa^ACCCTTCTOTCTTATACAATAATNATAOATAAATO 



60 



* 80 * . 100 * 120 

TrMDHf : TTATGGAGCCAAATTCAGATGCAAATCAACGAATCGCyUU^ 



120 



* 140 * 160 * 180 

TrMDHf : CTCCCAATTTCAAGATGAATGAACATGGTGATTCTTCTTTGACAAGTTTCCATTGCCGTG 



180 



* 200 * 220 * 240 

TrMDHf : CAAAAGGTGGAGCACCTGGATTCaUUlGTTGCAATTTTAGGTGCTGCI^ 



240 



* 260 * 280 * 300 

TrMDHf : AACCTCTTTOUVTGTTGATGAAGATGAATCCTTTGGTTTNAGTTCTTCATCTOT 



300 



TrMDHf 



* 320 * 340 * 360 

TTGTTAATACTCCTGGTGTTACTTCTGATATTAGTCATATGGATACTGCTGCTGTTGTTC 



360 



* 380 * 400 * 420 

TrMDHf, : GAGGGTTTTTGGGGCS^AAATaVQCTTGAGQATGCACTTACSVGGTATGGATTTGGT^ : 420 

* 440 * 460 * 480 
TrMDHf : TTCCTGCCXK5TGTTCCCCGTAAACCTGGAATGACAAGAGATGATCTCTTCAATATAAATG : 480 

* 500 . * 520 * 540 
TrMDHf : CCGGGATCGTTAAAACACTCTGTGAAGCAATTGCAAAGCGATGTCCTAAGGCGATTGTCA : 540 



* * " 560 * 580 * 600 

TrMDHf : AaSTGATTAGTAATCCGGTTAACTCCACTGTCCCO^TTGCGGCT^^ 



600 



* 620 * 640 * * 660 

TrMDHf : CCGGTACTTATQATCCaU^GAGACTTTTGGGAGTGACaVATGCTTaATGTGGTTCGGG 



660 



* 680 * 700 * 720 
TrMDHf : ATACGTTTGTGGCT6AAGTTCTTGGTCTTGATCCAAGGGATGTGGATGTCCCAGTTOTCG : 720 

* 740 * 760 * 780 
TrMDHf : GAGGACATGCCGGAATCACCATTTTACCTCTGCTTTCTCAGGTTA;^ : 780 



* 800 * 820 * 840 

TrMDHf : TO^CGACAAAGGAAATTQAGTACTTGACAGATCGCATACaAAACGGTCOAACr^^ 



840 



♦ 860 
TrMDHf : TTGAGGCCAAAGCTGGAGCTGQCTCT : 866 



Figure 71 Deduced amino acid sequence of TrMDHf 

* 20 * 40 * 60 
TrMDHf : tffiPNSDANQRIARISGHLNPPNFKMNBHGDSSLTSFHCIUUeGGAPGFKVAIU^ : 60 

* 80 ' * loo * X20 
TrMDHf : PLSMLMKMOTLVXViaLYDVVOTPGVTSDISHMDTAAV^ . 120 

* 140 * 160 . * 180 
TrMDHf : PAGVPMCPGMTiyDDLFNINAQIVKTI.CEAIAKRCPKMVNV . leo 

* 200 * 220 * 240 
TrMDHf : GTYDPKRLLGVTMMVVRANTFVAEVLGIiDPRDVDVPWGG^^ • 240 

* 260 

TrMDHf s TTKBIEYLTDRIQNGGTEWEAKAGAGS : 26.8 ' 




Figure 72 Nucleotide sec[uences of nucleic acid fragments contributing to the 
consensus cent ig sequence TrMDHf 



20 



40 



60 



TrMDHf 1 
TrMDHf 2 
TrMDHf 3 



GWNTACMGCTATCNACCCTTCTTTCTTATACAATAATNATAGATAAATi'TCATCTGCTAAA 



60 



80 



100 



120 



TrMDHf 1 
TrMDHf2 
TrMDHfS 



TTATGGAGCCAAATTCAGATGCAAATCAACGAATCGCAAGAATCTCCQGCCACCTAAATC 



120 



140 



160 



180 



TrMDHf 1 
TrMDHf2 
TrMDHf3 



CTCCCAATTTCAAGATGAATGAACATGGTGATTCTTCTTTGACAAGTTTCCATTGCCGTG 



180 



200 



220 



240 



TrMDHf 1 
TrMDHf2 
TrMDHfS 



CAAAAGGTGGAGCACCTGGATTCAAAGTTGCAATTTTAGGTGCTGCTGGTGGCATAGGTC 

GTGNCATAGGTN 



240 
12 



260 



280 



300 



TrMDHfl 
TrMDHf2 
TrMDH£3 



AACCTCTTTCAATGTTGATGAAGATGAATCCgTTGGTTTOAGTTCTTGATCTTTATGATG 

aIcctctttH^atgttgatgaagatgaatcctStggtttqagttcttgatctttatgatg 

tttggtttnhgttctt^hctttatgatg 



299 
70 
29 



320 



340 



360 



TrMDHfl 
TrMDHf2 
TrMDHf3 



ttgttaatactcctggtgttacttctgatatta.gtcagatggatactg||tgctgttgttc 
ttgttaatactcctggtgttacttctgatattagtcatatggatactgctgctgttgttc 
ttgBtaatactcctggtgBtacttctgatattagtHatatggatactgctgctgttgttc 



380 



400 



420 



TrMDHfl 
TrMDHf2 
TrMDHf 3 



gaggStttttggggcaaaatcagcttgaggatgcacttacaggtatggatttggtaatca 
gagggtttttggggcaaaatcagcttgaggatgcacttacaggtatggatttggtaatca 
gagggtttttggggcaaaatcagcttgaggatgcacttacaggtatggatttggtaatca 



419 
190 
146 



440 



460 



480 



TrMDHfl 
TrMDHf2 
TrMDHf 3 



TTCCTGCjlGGTGTTCCCCGTAAACCTGGAATGACAAGAGATGATCTCTTCAATATAAATG 
TTCCTGCCGGTGTTCCCCGTAAACCTGGAATGACAAGAGATGATCTCTTCAATATAAATG 
TTCCTGCCGGTGTTCCCCGTAAACCTGGAATGACAAGAGATGATCTCTTCAATATAAATG 



CCGGGATCGTTAAAACACTCTGTGAAGCAATTGC|iAAGCGATGTCCTiij\GGCGATTGTCA 
CCGGGATCGTTAAAACACTCTGTGAAGCAATTGCAAAGCGATGTCCTAAGGCGATTGTCA 
CCGGGATCGTTAAAACACTCTGTGAAGCAATTGCAAAGCGATGTCCT^AGGCGGTTGTCA 



TrMDHfl : 
trMDHf2 : 
TrMDHf3 : 



560 



580 



600 



TrMDHfl 
TrMDHf2 
TrMDHf3 



ACGTGATTAGTAATCCGGTTAACTCCACTGTCCgQQBB^B^B^^B^^^BBSS 

ACGTGATTAGTAATCCGGTTAACTCCACTGTCCCCATTGCGGCTGAAGTTTTCAAAAGAG 
ACGTGATTAGTAATCCGGTTAACTCCACTGTCCCCATTGCGGCTGAA(?TTTTCAAAAGAG 




'620 



TrMDHfl 
TrMDHf 2 
TrMDHf 3 



TrMDHfl 
?rMDHf2 
TrMDHf 3 



TrMDHfl 
TrMDHf2 
TrMDHf3 



TrMDHfl : 
TrMDHf 2 : 
TrMDHf 3 : 



64 0 



660 



\ 



TrMDHfl 
TrMDHf 2 
TrMDHf3 



CCGGTACTTATGATCCCAAGAGACTTTTGGGAGTGACAATGCTTGATGTGGTTCGGGCCA 
CCGGTACTTATGATCCCAAGAGACTTTTGGGAGTGACAATGCTTGAT^TGGTTCGGGCCA 


* 680 


* 7.00 • * 720 


ATACGTTTGTGGCTGAAGTTCTTGGTCTTGATCCAAGGGATGTGGATGTCCCAGTTGTCG 
ATACGTTTGTGGCTGAAGTTCTTGGTCTTGATCCAAGGGATGTGGATfiTrrranTTnTPrt 


* 740 


* 760 * 780 


GAGGACATGCCGGAATCACCATTTTACCTCTGCTTTCTCAGGTTAAACCACATTCCTCTT 
GAGGACATGCCGGAATCACCATTTTACCTCTGCTTTCTCAGGTTAAAcbcACATTCCTCTT 


• * 800 


* 820 * 840 


TCACGACAAAGGAAATTGAGTACTTG 




TCACGACAAAGGAAATTGAGTACTTG 


ACAGATCGCATACAiQ.AACGGTd3GAACTGAAGT.TGl 


■ * 860 





TTGAGGCCAAAGCTGGAGCTGGCTCT 



430 
386 



490 
446 



550 
506 



576 
566 



552 



Figure 73 Consensus contig nucleotide sequence of TrMDHg 



* . 20 * 40 * 60 

TrMDhg : GTAGGCAGC^TCTiuVCAGCACAATGAACATGGAAA^ : 60' 



* 80 * 100 * 120 

TrMDhg : TACGGTCCTTAAAAAATCTGTTCTTGTTTTATTTTGTACTTTTTTGTTTTGGAAGATCGT : 120 



* 140 *' 160 * 180 

TrMDhg : TAGATACATGTGTGGTCTTCTOUU^GTTGATAAGGAACCAGTOICTGTATTGGTCA : 180 



* 200 . * 220 * 240 

TrMDhg : TGCTGCTUSGACAAATTGGin'ATGCTCTTGNTCCAATQAT^ : 240 



* 260 * 280 * 300 

TrMDhg : CCCAAATCAACCTGGAATTCTTCATATGCTWGATATTGAACCAGGATTAGAGGCCCTTAA : 300 



*' 320 * 340 * .360 

TrMDhg : AGGGGTGAAGATGGAACTGATTGATGGTGCTTTCCC^^CTTCITCAGM • 360 



* 380 * 400 * 420 

TrMDhg : TACGGATGTTGTTGAAGCATGCAAGGATGTTAACATTGCTGTTATGC^^ 420 



* 440 * 460 * 480 

•TrMDhg : AAGGAAGGAAGGAATGGAAAGAAAAGATGTAATGTCTAAGAATGTTTC»-ATTTACAAGGC : 480 



* 500 * 520 * 540 

TrMDhg : TCAAGCTTCAGCTTTGGAGGAGCATGCTGCTGCyiGATTGTAAAGTGCTAGTGGTAGCa^ : 540 



* 560 * 580 * 

TrMDhg : TCCAGCaVAACACAAATGCTCTAATATTGAAAGAATTTGCTCCaiTC^ : 599 




Figure 74 Deduced amino acid sequence of TrMDHg 

* 20 .* 40 * 60 

TrMDhg : MCGLLKVDKBPVTVLVTGAAGQIXYJO^XPMIMeGMML : 60 



* 80 . * 100 * ' 120 . 

TrMDhg : KMELIDGAFPLIiRGWATTDVVEACKDVNIAVMLGGSPRKEGMERKDVMSK^ : 120 



* 140 * 

TrMDhg : SALEBHAAADCK\n;iWi^Aim33^IIiKEFAPSIPEK : 157 . 



Figure 75 Nucleotide sequences of nucleic acid fragments contributino to the 
consensus contig sequence TrMDHg 

* ^^20 * • 40 * fin 

TrMDHg2 : ^^ BSlS^bggBHSOffifigS^ ' 

■ * 80 * 100 * ion 

TrMDHgl : |i|gLc^44^,^,^.|.|Jr,^J.Ai^<b _^ 
TrMDHg2 : BBSSSSSSMM IIIBSS^ ' 

* 140 * * ' iqn 

TrMDHgl : t^iVJcl>j€AilelelifJiiLJJ||[Ui|.|A f c|,j,jekU^^ ' 
TrMDHg2 : BBBSSSBSBSSS^ SB^ ' • 

* 200 * 220 * pAft • 

TrMDHgl : WclefJJAlAlAliLilclclA^i^ALJfJ^rjaLiLi/^t^i i^iiij^^^ 

TrMDHg2 : WSSBSMBSSSSSS S^^ ' 

iM i ft i-ri w min^mwYm^mn^wfii^j^m^mw . 243 

* 260 * 280 * 300 * 

TrMDHgl : MA^MMis^iii^M ik Mi^iiMdMMMS S^S m i« 

TrMDHg2 : l^f8BBs^ §BmB^BM * ^'^^ 

320 * 340 * r»£n * 

TrMDHgl : ^f" C * 

TrMDHg2 : SBSBBEBBagEca aBBiBgiagag^ * 

380 * 400 * 420 * 

TrMDHgl r --^^ _____ _ 

TrMDHg2 : Wekyel^k^iJc iaAlJeiyrAy^^ f i ■ ■ i '"r i " i "."__'l ""."f" * 

• * 460 * 480 * 
TrMDHgl : 

TrMDHg2 : ^dclaklA f ckl Al^^^ ^ n . I I j j7uMn' | T | 'rrr i T l Vr i I r i V i V" '",'""-" '' ' 

* 520 * 540 * * = 

TrMDHgl : ■ 

TrMDHg2 : lefJc<cL ! cf>^ yH4H4nlTltHrK^^ I ■ LL I iT l 'lVj i l l riLLLV i Vi" "f^^^ ' 

60 . * 580 * 
TrMDHgl : ' 




Figure 76 Consensus contig nucleotide sequence of TrMDHh 



* 20 ■ * 40 * 60 • 

TrMDHh s GNNTACNGCTATCNACCCTTCTTTCTTATACAATAATO : 60 



* 80 * 100 * 120 

TrMDHh : TTATGGAGCCAAATTCAGATGCAAATCAACGAATCGCAAGAATCTCCGGCCyV : 120 



* 140 * 160 * 180 

TrMDHh : CTCCCAATTTCM.GATGAATGAACATGGTGATTCTTCTTTGACAA^ : 180 



* 200 ** 220 * 240 

TrMDHh : CAAAAGGTGQAGCACCTGGATTCAAAGTTGCAATTTTAGGTGCTGCTGGTGGCATAGGTC : 240 



* 260 *. 280 ** 300 

TrMDHh : AACCrCTTTCa^TGTTGATGAAGATGAATCCTTTGGTTTNAQ^ : 300 



* 320 * 340 * 360 

* TrMDHh : TTGTTAATACTCCnXK3TGTTACTTCTGATATTAGTCATATGGATACTGCTGCTG : 360 



* 380 * 400 * 420 

TrMDHh : GAGGGTTTTTGGGGCyU^AATCAGCTTGAGGATGCavCTTACAGGTATGGATTTGQTAATCyi : 420 



* 440 * 460 * 480 
TrMDHh : TTCCTGCCGGTGTTCCCCGTAAACCTGGAATGACAAGAGATGATCTCTTOATATAI^^ ; 480 

* 500 * 520 * 540 
TrMDHh : CCGGGATCGTTAAAAO^CTCTGTGAAGCy^TTGCWU^GCGATGTCCTAAQGOT : 540 



* 560 * 580 * 600 
TrMDHh : ACGTGATTAGTAATCCGGTTAACTCCa^CTGTCCCCATTGOSGCTGAAGTTTTCAAAAGAG : 600 

* 620 * 640 * 660 
TrMDHh : CCGGTACTTATGATCCCTU^AGACrrTTGGGAGTGACAATGCTTGATGTGGra : 660 



* 680 * 700 * 720 

TrMDHh : ATACGTTTGTGGCTGAAGTTCTTGGTCTTGATCCAAGGGATGTGGATGTCCCAGTTGTCG : 720 



* 740 *• 760 * 780 

TrMDHh : GAGGACATGCCGGAATCACCATTTTACCTCTGCTTTCTCAGGTTAAACCACATTCCTCTT : 780 



* 800 * 820 * 840 

TrMDHh : TCACGACAAAGGAAATTGAGTACTTGACaiQATCGCATACAAAACGGTGGAAC^^ : 840 



* 860 
TrMDHh : TTGAQGCCAAAGCTGGAQCTGGCTCT ; 866 



•Figure 77 Deduced amino acid sequence, of TrMDHh 

* 20 * ' 40 * 60 

TrMDHh : MEPNSDA^rQRIARXSGHIl]^PNFK^INEHGDSSIJTSFHC^^ : 60 



* 80 * 100 * 120 
TrMDHh : PLSMIiMKMNPIiVXVLHLYDVVNTPGVTSDISHMDTAAVVRGFLGQNQ^ : 12 0 

# 

* . 140 * 160 * 180 
TrMDHh : PAGVPRKPGMTRDDLFNiroVGIVKTLCEAIAKRCPKAIVNVISNPW : 180 



* 200 * 220 * 240 

TrMDHh : GTYDPKRLLGVpiLDVVIUUjrrFVJ^BVLGIiDPra : 240 



* 260 
TrMDHh : TTKEIEYIiTDRIQNGGTEWEAKAGAGS : 268 



Figure 78 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contlg sec[uence TrMDHh 



* 20 * 40 * 60 

TrMDHh2 : ! 

TrMDHh3 : . ! 



^ 80 •• * 100 - . * 12 0 

TrMDHhl : liilnbMrfclJIrf^tJAlA^^^ . 'j^20 

TrMDHh2 : --.^ ~ „~ „V_ _1 1 „ VI7~IZJSZZZZ~l 1 1__ ] 

TrMDHha : . ! 



* 140 * 160 * ■ 180 

TrMDHh2 : — - — - „ - - „ . - - 1 - - V- „ „ ""7- - 1 ~1 ^S~r"Il"S""^ \ 

TrMDHha : . ! 



) * 200 * 220 * 240 

TrMDHhl : ^^BB^l^^BBB^B^^SB^^^S8^S33BB8B833ISBBSBE38ISCW<fa : 240 

TrMDHh2 : * 3^2 

TrMDHh3 : ! 



* 260 * 28 0 • * 300 

TrMDHhl : tMgblii[ik4AJ^^jdlilijefejl>^^^ : 299 

TrMDHh2 : S^jfiBBBBBnTSMBBiBBiB^ ] 70 

TrMDHh3 : ' -TIlTrTT!! , - „ „\Jffi3S | 29 



* 320 * 340 * 360 

TrMDHhl : lHBiEB^^S^S^SiBSikl4rj|J4iBEB3IBaCT^ : 359 

TrMDHh2 : ^^88188^^^^^85888888^6^68388 ! 13 o 

TrMDHh3 : BH-8B88B88B8BB83888WS8SB88^^ ! 86 



* 380 * 400 * 420 

TrMDHhl : fe^fibfe^abkAAJcfeidgbyj^yj^^^^ : 419 

TrMDHh2 : ^^^^^^^^^^^^33B^B8883Se8i689Si^ i 190 

TrMDHh3 : t^fflM8B88^ffiB8ffiBH88BSHi8^|ppM ! 



* 440 * 460 * . 480 

TrMDHhl : lubbbfeJdbiysbkJski^^ : 479 

TrMDHh2 : ^^^^8i^^^B883S^B888SBB8BB?^^ ! 250 

TrMDHh3 : BBBBBH^88BHgg8B88ffiH98^^ ! 206 



* 500 * 520 * 540 

TrMDHhl : bbfejcfetAJilgk^^ilili^^^ . 539 

TrMDHh2 : ^^^^^^SBSSBloSiWBSBHclS^ i 3x0 

TrMDHh3 ; 88888888888^8888816(8^^ ! 266 



* 560 * 580 . * 600 

TrMDHhl : HgjgyctAAaMe^^fMlJWWctcti^i)^^^^ . 572 

TrMDHh2 : ^8^l^^8iEH9c(cB8iBB88B^ : 370 

TrMDHh3 : 88SS889iS9i8BBBH88iB8 ! 



TrMDHhl 
TrMDHh2 
TrMDHhS 



620 



640 



660 



CCGGTACTTATGATCCCAAGAGACTTTTGGGAGTGACAATGCTTGATGTGGTTCGGGCCA 
CCGGTACTTATGATCCCAAGAGACTTTTGGGAGTGACAATGCTTGATqTGGTTCGGGCCA 



430 
386 



TrMDHhl 
TrMDHh2 
TrMDHh3 



680 



700 



720 



ATACGTTTGTGGCTGAAGTTCTTGGTCTTGATCCAAGGGATGTGGATGTCCCAGTTGTCG 
ATACGTTTGTGGCTGAAGTTCTTGGTCTTGATCCAAGGGATGTGGATdTCCCAGTTGTCG 



490 
446 



TrMDHhl : 
TrMDHh2 : 
TrMDHh3 : 



740 



760 



780 



GAGGACATGCCGGAATCACCATTTTACCTCTGCTTTCTCAGGTTAAAGCACATTCCTCTT 
GAGGACATGCCGGAATCACCATTTTACCTCTGCTTTCTCAGGTTAAAGCACATTCCTCTT 



550 
506 



TrMDHhl 
TrMDHh2 
TrMDHh3 



TrMDHhl 
TrMDHh2 
TrftDHh3 



800 



820 



840 



TCACGACAAAGGAAATTGAGTACTTG _ 

TCACGACAAAGGAAATTGAGTACTTGACAGATCGCATACAAAACGGTdGAACTGAAGTTG 



576 
566 



860 



TTGAGGCCAAAGCTGGAGCTGGCTCT 



592 



Figure 79 Consensus contig nucleotide sequence of TrMDHi 



* 20 • * 40 * 60 
TrMDHi : GNAATCCTCTTTONCTCCCCTACCCTCCmTTTTTCCCT : 60 

* 80 * • 100 * 120 
TrMDHi : TCAACTTTCCACCTCTGAACAAAACTTa^TCTTTTCTC^^ : 120 

* 140 * 160 * 180 
TrMDHi : AACTTCTTCATAAAGTGTTAGGTTTTTTTTTATTACTCTTTTCAAGAACCAC^^ : 180 

* 200 * * 220 * 240 
TrMDHi : T6TTTCTTGAATTCTTTQGAATTTTTT.TTTTCCTGCAACCATGGCCTTGGC7^CACTTA^ : ' 240 

* 260 *' 280 * 300 
TrMDHi : O^CCCCACTTGCTCAAAAACTCAACTTCACTCATCACAACTCTCATTTCTCTCTAGGAC : 3 00 

* 320 * 340 • * 360 
TrMDHi s TCTCCCTAGGOUlTATCACTGTACTTTTGCa^CCACTTCACAGAACTCa^C^ : 360 



* 380 * 400 * 420 

TrMDHi : TACTTGTTCTGTTGCACCAAATO^GTGCAGGCTCCAGCTGTACA^ : 420 



* 440 * 460 * 480 

TrMDHi : GAATAAGCCTGATTGCTATGGTGTCTTCTGCCTTACCTATGATTT6AAGGCTGAAGAGGA : 480 



* 500 * 520 * 540 

TrMDHi : GAaVAAATCCTGQAAGAAATTAATC2\Aa^TTGaAGTCTC^^ : 540 



* 560 * 580 * 600 
TrMDHi : CAATCATCTACTTTTCAAGCTTGCATCTGGTGAAQTTTTTG^ : 600 

J 

* 620 * 640 

TrMDHi : GCTGAAATTATTAGGATCAGAAAGGTCCTTCCAAGCTCTTGAAGGTG : 647 



Figure 80 Deduced amino acid sequence of TrMDHi 

* 20 * 40 * 60 

TrMDHi : P4AIAHLNNPTCSKTOIiHSSQLSFIiSRTLPRQYHCTFAPL^ : 60 



* 80 * 100 * 120 

TrMDHi : VQSQDPKNKPDCTGVPCTbTYDIiKABEETKSWKKLINIAVSGAAGMISNHIjLPK^ : 120 



* 140 
TrMDHi : GPNQPXALKCiXiGSERSFQAXiEG : 142 



Figure 81 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contig sequence TrMDHi 



[GWAATCCTCTTTGNCTCCCCTACCCTCCTTTTTTTTCCTTCCTTCTTACAl 


B 


CTTCTCTTCT 




,TTCTTA!gACCTTCTCTTAT| 



TrMDHi 1 : 



TrMDHi 1 : 
TrMDHi2 : 



TrMDHil : 
TrMDHi2 : 



TrMDHil : 
TrMDHi 2 : 



TrMDHil 
TrMDHi2 



* 8.0 




* 100 * 120 


icaactttccacctctgaacaaaactt 
Haactttcnacctctgaac©aaaHtti 


CgATCTTTTCTCATTTTCTTATACCCTTTTAGAAA 
HAATCTTTTCTBATTTTCTTATfACCCTTTTACTiAA 


* 140 




* 160 *_ 180 


CTTCTTCATAAAGTGTTAff^^TgHTTTTTT^ 

cttcttcataaagtgtt.gggtttttttttattactcttttcaagaacgacaaaaacagtgt 


* 200 




* 220 • * 240 


TTCTTGAATTCTTTG:gAATTTTTTTTTTCCTGCAACCATGGCCTTGGGACAiGTTAAACAAg 
TTCTTGAATTCHTTGGAAHTTTTTTTTTCCTGCAACCATGGCffiXTGGGACACTTAAACAAC 


* 260 




280 * 300 


cccacttgctcaaaaactcaacttcactcatcacaactctcatttgjtgtctaggactctcc 
cccacttgctcaaaaactcaacttcaStcatcacaSctctcatttctgtctaggactctcc^ 


* 320 


* 


340 * 360 


ctaggcaatatcactgtacttttgcaccacttcacagaactcaacatggcagaattacttg 
ctaggcaatatcactgtacttttgcaccacttcacagaactcaacatggcagaattacttg 



60 
19 



121 
76 



160 
137 



241 
196 



302 
257 



363 
318 



380 



400 



420 



TrMDHil : 
TrMDHi2 : 



ttctgttgcaccaaatcaagtgcaggctccagctgtacaatcacaggatcccaagaataag 
ttctgttgcaccaaatcaagtgcaggctccagctgtacaatcacaggHtcccaagaataag 



424 
379 



440 



460 



480 



TrMDHil : 
TrMDHi2 : 



cctgattgctatggtgtcttctgccttacctatgatttgaaggctgaagaggagacaajvat 

CCTGATTGCTATGGTGTCTTCTGCCTTACCTATGATTTGAAGGCTGAi\GAGGAGACAAAAT 



: 485 
: 440 



500 



520 



540 



TrMDHil : 
TrMDHi2 : 



CCTGGAAGAAATTAATCAACATTGCAGTCTCAGGTGCTGCTGGAATGATTTCCAATCATCTi 
CCTGGAAGAAATTAATCAACATTGCAGTCTCAGGTGCTGCTGGAATGTiTTTCCAATCATCT, 



546 
501 



560 



580 



600 



TrMDHil 
TrMDHi2 



A.CTTTTCAAGCTTGCATCTGGTGAAGTTTTTGGgCCAAATCAACCTATTGCGCTGA^ 
ACTTTTCAAGCTTGCATCTGGTGAAGTTTTTGGCCCAAATCAACCTAlfTGCGCTGAAATTA 



: 602 
: 562 



620 



640 



TrMDHil : 
TrMDHi2 : 



TTAGGATCAGAAAGGTCCTTCCAAGCTCTTGAAGGTG 



: 599 



^re 82 Nucleotide sequence of TrMDHj 



* 20 * • 40 * 60 

TrMDHj : GCAAAGCNCTOlJCNGACCrGGTGTGGAGCGAGCAGCTTTGCTAGACATAAATGGGC^ : 60 



* 80 * 100 * 120 

TrMDHj : TTTTGCGGAGCAGGGAAAAGCTCTAAATGOlGTCGaVTCTCGCAATC : 120 



* • 140 * 160* . * 180 
TrMDHj : TGTGGGAAACCCTTGCAATACWUVTGCaVTTAATATGCTTGAAGAATGCTCCAAATAT^ i 180 

* .200 * 220 * 240 
TrMDHj : TGaUUUVAATTTTGATGCTTTAACCCXSTTTAG^ : 240 



* 260 .* 280 * 300 

TrMDHj : CCTCAAGGCTVGGTQTCTTCTACGATAAAGTGTCGAATATGACGATATGGGGAAACC^ : 300 



* 320 * 340 * 360 
TrMDHj : AACTACTCAGGTCCCCGATTTCTTAAATGCCAGAATCGATGGTTTGCGTGTCAAAGAAGT : 3-60 

* 380 ' * 400 * 420 
TrMDHj : GATTAAGGATCAAAAGTGGTTAGAGGAAGAGTTCACCGAAAAAGTTC^UU^^ : 420 



* 440 * 460 * 480 

TrMDHj : CGTCCTTATTCS^AAAGTOGGGAAGATOSTCTGCTGCATCAACTTC^ : 480 



* 500 * 520 * 540 

TrMDHj : TGCCATACGATCTTTGATCACTCCTACTCCGGAGGGTGATTGGTTTTCTACTGGTGTGTA : 540 



* 560 

TrMDHj : TACAGCTGGAAATCCTTATGGAATAGCTG : 569 



^^lle 83 Deduced amino acid 8eq[uence of TrMDHj 

* 20 * 40 * . 60 
TrMDHj : QSXXXPQVBIUVAIiLDINGQIPAEQOKAIiNAVASHNVKVIWGNPCNTNM : 60 

* 80 * * 100 *' 120 
TrMDHj : AKNFHALTRIiDBKOyVKCQIJU^KAGVFYDKVSl^ : 120 



* 140 * 160 * 180 

TrMDHj : IKDQKWIjEEBFTEKVQKRGGVIiIQKWGRSSAASTSVSIVDAIRSLITPTPEGDWFSTGVY : 180 



TrMDHj : TAGNPYGXA : 189 




Figure 84 Nucleotide sequence o£ TrMDHk 



* 20 • * 40 * * 60 

TrMDHk : GNGTAGAACCCGTGAAGCCTTTTCCCTCCGGTCTCCCCGCTTGCGCCGTCGCCGTCAATT : 60 



* 80 * ' 100 * 120 

TrMDHk : GCTGCTTGTGTCGTCGCCTCCAGCTCCTCCTCCTCC7^CTGTGCCAACCGAATTACa\^ : 120 



* 140 * 160 * 180 

TrMDHk : AAAAAAATGGCGACTTGTTTGaUU^CACiU^CrCCTCCACACAAGACCTTTTCAGTTTCGG : 180 



* 200 * 220 • * 240 

TrMDHk : TCTTCCTOTTCGACAAGACCAACTTCCCTAAGATGTTCCGCCGCCACCCCATCCACC^^ : 240 



* 260 * 280. * 300 

TrMDHk : AAATCCTACAAAATCACTCOTCTTCCGGGTGATGGCATAGGTCCTGAAGTCGTT^ : 300 



* 320 * 340 * 360 

TrMDHk : GCTAAAGACGTTCTTCTCCTCACTGGATCCATCCATGGGATTAAACTTGAGTTTCAAGAG : 360 



* 380 * 400 * 420 

TrMDHk : AAGCTTOTGGGTGGTGCTGCTCTTGATGCT^ : 420 



* 440 * 460 ** 480 

TrMDHk : TCTGTTGCTAAGCAATCTGATQCTGTTCTTCTTGGTGCTATTGGAGGGTATAAAl^^ : 480 



* 500 * 520 * . 540 

TrMDHk : AAAAATGAGAAf^CAGCTGT^GCCAGAAACTGGATTGCTTCAGCTACGAGAAGGGCTTCAA : 540 



TrMDHk : GTTTTTGCTAATCTCAGA : 558 



Pxgure 85 Deduced amino acid sequence of TrMDHk 

* 20 * 40 * 60 

TrMDHk : MATCLQTQIjI.HT^FQPRSSSSTRPTSliRCSAATPSTKKSYKITLIiPGDGIGPEVVSVAK i 60 



80 * 100 



120 



TrMDHk-: DVLLLTGSIHGIKLEFQBKI1I.GGAAIJDATGVPLPDDTLSVAKQSDAVLLGAIGG : 120 

* 140 
TrMDiEOc : EKQLKPSTGIiIiQIiREGIiQVFANLR : 144 



I 



^^^^-e 86 Consensus contig nucleotide sequence of TrPEPCa 



* 20 ♦ 40 * 60 

TrPEPCa : GNNACATTOCCGAATGCTGCTGAACTAGGGAGTGATTCCCTTGGAGCCTATGTCATCTCT : 60 



* . 80 * 100 * 120 

TrPEPCa : ATGGCCTCAAGTGCAAGCGATGTCCTTGCAGTAGAGCTTTTACAOAAQGATGCACGTCTT : 120 



* 140 * 160 .* 180 

TrPEPCa : ACAGTTTGTGGAGAATTAGOAAGAGCATGTCCGGGTGGAACGCTTCGGGTGGTTCCTCTA : 180 



* 200 * 220 * 240 

TrPEPCa : TTTGAAACTGTGCAAGACCTGAGAGGAGCTQGTGCAGTTATCAGAAAACTTTTATCAATC : 240 



* 260 * 280 * -300 
.TrPEPCa : GATTGGTACCGCCAACACATCATTAAGAACCATAACGGACAdCAAGAGGTTATGGTCGGT : 300 

) 

* 320 * 340 * . 360 
TrPEPCa : TATTCTGATTCTGGTAAAGATGCCGGGCGCTTTACTGCTGCTTGG6AACTTTACAAAGCT : 360 



* 380 * 400 * 420 

TrPEPCa : CAAGAGGATGTAGTGGCTGCTTGCAATAAGTACGATACTAAGGTTACTTTGTTCCACGGC : 420 



* 440 * 460 * 480 

TrPEPCa : CQCGGAGGGAQTATTGGACGTGGCGGAGGCCCAACATATCTGGCTATTCA6TCCCAGCCA : 480 



* 500 * 520 ' * 540 

TrPEPCa : CCTGGCTCTQTGATGGGAACCCTTCGGTCAACTGAGCAGGGAGAGATGGTGCAGGCCGAG : 540 



^ * 560 * 580 * 600 

TrPEPCa : TTTGGGTTGCCACAGACAGCAGTTAGACAACTTGAAATATACACAACAGCTGTGCTACTT : 600. 

» 

* 520 * 640 * 660 

TrPEPCa : GCTACACGTCGTCCACCACTCCCACCTCGAGAAGAAAAATGGCGTAATCTAATGGAAGAC : 660 



* 680 * 700 * 

TrPEPCa ! ATNTCAAAAATCAGTTGTCAGTCCTACCGCAGTGTAGTCTATGAAAATCCAGN- 713 




e 87 Deduced amino acid sec[uence of TrPEPCa 



* 20 * 40 * 60 

TrPEPCa : XTXPlO^AELaspSLGAYVISMASSASDVXiAVELLQKDiUiLTVCGELGI^ 



60 



* 80 * 100 ' * 120 

TrPEPCa : PETVQDIiRGAGAVIRKLLSIDWYRQHIIKNraSTGHQEVMVGYSDSGKDAGRP^ : 120 



* 140 * 160 * . 180 

• TrPEPCa : QEDWAACNKYDTKVTLPHGRGGSIGRGGGPTYliAIQSQPPQSVMGTLRSTEQGEMVQM : 180 



* 200 * 220 * 

TrPEPCa : FGLPOTAVRQlBIYTTAVIilATRRPPLPPREEKWRNI^ : 237 



1 



Ke 88 Nucleotide sequences of nucleic acid fragments contributing to the 
nsus contig sequence TrPEPCa 



* 20 



GIMACATTNCCGAATGCTGCTGAACTAGGGAGTGATTCCCTTGGAGQCTATGTCATCTCT 



TrPEPCal : 

TrPEPCa2 : 

TrPEPCa3 : . " 

* 80 * 100 * 120 

TrPEPCal — — 
TrPEPCa2 
TrPEPCaS 

* 140 * 160 ' • * 180 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



ATGGCCTCAAGTGCAAGCGATGTCCTTGCAGTAGAGCTTTTHCAGAAiGGATGCACG^ACTT 

GN25CTTTTACAGAAGGATGCACGTCTT 
lAGCTTTTACAGASfGGATGCACGTCTT 



GCgGgiT&lTGGAGAgTTgGGAAGAGCATGTCCgGGTGG^^ 

ACAGTTTGTGGAGAATTAGGAAGAGCATGTCCGGGTGGAACGCTTCGGGTGGTTCCTCTA 
ArAGTTTGTGGAGAATTAGGAAGAGCATGTCCGGGTGG AACGCTTCQGGTGGTTCCTCTA 



119 
27 
26 



179 
87 
86 



1 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



200 



220 



240 



TTTGAAACTGTGj^GGACCTgAGAGGAGCTGGTilCAGTTATCgG'GA^ 

TTTGAAACTGTGCAAGACCTGAGAGGAGCTGGTGCAGTTATCAGAAAkCTTTTATCAATC 

TTTnaAAPTnTGrrAAGACCTGAGAGGAGCTGGTGCAGTTATC AGT^A^ACTTTTATCAATC 



239 
147 
146 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



260 



280 



300 



GAg^TGGTACCGp'AACACATCATTAAGAACCAiAAgGGACAgCAAG/^^ 
GATTGGTACCGCCAACACATCATTAAGAACCATAACGGACACCAAGAGGTTATGGTCGGT 
GATTGGTACCGCCAACACATCATTAAGAACCATAACGGACACCAAG^GGTTATGGTCGGT 



299 
207 
206 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



320 



340 



360 



TATTCTGATTCgGGTAAAGATGCgGGgCGCTT.gACTGCTGCTTGGG?^CTTTACAAAGCT 
TATTCTGATTCTGGTAAAGATGCCGGGCGCTTTACTGCTGCTTGGG^pCTTTACAAAGCT 
TaTTrTaATTrTnGTAAAGATGCCGGGCGCTTTACTGCTGCT TGGG4?^CTTTACAAAGCT 



359 
267 
266 



TrPEPCal 
. TrPEPCa2 
*TrPEPCa3 



380 



400 



420 



CA;GGAGGATGT|5GTiGCTGCTTGCAAT|AiTAgG:GTA^^^ 

CAAGAGGATGTAGTGGCTGCTTGCAATAAGTACGATACTAAGGTTAQTTTGTTCCACGGC 
rQAnAnnATGTAGTGGCTGCTTGCAATAAGTACGATACTAAGGT TAqTTTGTTCCACGGC 



419 
327 
326 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



440 



460 



480 



CGiGGAGGgAGTATTGGggCGgGGiGGiGGCCCgACATATCTGGCTATjTCAGTCCCAiCCA 
CGCGGAGGGAGTATTGGACGTGGCGGAGGCCCAACATATCTGGCTATTCAGTCCCAGCCA 
rnrnnAnnnAr^TATTGGACGTGGCGGAGGCCCAACATATCTGGCT ATjTCAGTCCCAGCCA 



479 
387 
386 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



500 



520 



540 



CCTGG"gTCTGTGATGGGAACgCTTCGGTC"gACTGAGCAGGGAG7^TGGTg.;GAGGCCAAG 
CCTGGCTCTGTGATGGGAACCCTTCGGTCAACTGAGCAGGGAGAGATGGTGCAGGCCGAG 
CCTGGCTGTGTGATGGGAACCCTTCGGTCAACTGAGCAGGG AGAGATfGGTGCAGGCCGAG 



539 
447 
446 



560 



580 



60O 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



TTTGGGTTi^'CCACAGA|i|AGCgGTTAGACAACTTGAi2S| 
TTTGGGTTGCCACAGACAGCAGTTAGACAACTTGAAATATACACAAGAGCTGTGCTACTT 
TTT-r.^^r^TTnrrAPAGACAGCAGTTAGACAACTTGAA ATATACACAAGAGCTGTGCTACTT 



576 
507 
506 



TrPEPCal 
TrPEPCa2 
TrPEPCa3 



620 



640 * 



TrPEPCal : 
TrPBPCa2 : 
TrPEPCa3 : 



ScTACACOTCGTCCACCACTCCCACCTCGAGAAGAAAAATGGCGTAATCTAA 
nPTACACGTCGTCCACCACTCCCACCTCGAGAAGAAAAATGGCGTAATCTAATGGAAGAC 




e 89. Consensus contig nucleotide sequence of TrPBPCb 



* . 20 * • 40 .. * -60 
TrPBPCb : GNAAGGGACAJMSCTCTATCGTACTCGTGAGCGGTCTCGCTATCTCTTAQCTCATGGCTAT : 60 

* 80 ' ■ *■ 100 * ' "° 
TrPBPCb : TCTGAAATTCCTGAAGAAGCCACATTCACCGATGTTGATGA6TTCTTGGAACCTCTTGAA : 120 

. * X40 * 160 * 180 

TrPBPCb : CTATGCTACAGATCACTCTGTOCTTGTGGTGATCGTGCGATTGCCGATGaAAGCCTTCTT : 180 

* 200 * 220 ■ * . 240 
TrPBPCb i GATTTCTTGAGGCAAGTTTCCACTTTTGGACTGTCACTQGTAAGACTTGATATAAGGCAA -. 240 

* 260 * 280 * 300 
TrPBPCb : GAGTCAQATCGTCACACGGACGTGATGGATGCCATTACCAAACATTTGGAAATTQQATCC = 300 



) * 320 * 340 * • 360 

TrPBPCb : TACCAA6A< 



.CTGGTeroAAGAAAAAAQACAGOAATGGCTTTTOTCTGAGTTGGTTGaCAAA : 360 



* 380 •* 400 * 420 
TrPBPCb. : AGGCCGCTTrTTGGACCTGACCTACCTCAAACCaATGAAATTASAGAAGTTTTAGAGACA : 420 

« 440 * 460 * 480 

TrPBPCb : TTTCaTGTCATAGCAGAACTTCCATCAGACAACTTTGGAGCCTATATCATTTCGATGGCA : 480 

* 50O * 520 * 540 
TrPBPCb : ACTGCCCCGTCTGATGTGCTAGCGGTTGAACTTCTTCAACGTGAATGCAAAATCAAGAAT : 540 

* 560 * 580 * 

TrPBPCb : CCGTTAAGAGTTOTTCCQTTGTTTGAaAAACTTGCTGATCTCGAGTCTaCTCCTGCTG : 598 



^^^^e 90 Deduced amino acid • sequence of TrPEPCb 

* 20 . * 40 * 60 
TrPEPCb : xraKLYRTRERSRYIiLJffiGYSEIPEEATFTDVDEFIiBPI-EIiC^ s ^0 

* 80 * 100 * 120 
TrPEPCb : DFLRQVSTFGLSIiVMiDIRQESDRHTDVMDAITKEiLEIGSYQDWSEEKRQEWLLSEL^^^ : 120 

* 140 * 160 * 180 
TrPEPCb : RPLPGPDLPQTDEIREVLETFHVIAELPSDNFGAYIISMATAPSDVIAVELLQRBCK : 180 



TrPEPCb : PLRWPLFEKLADLESAPA : 199 



^^Hpe .91 Nucleotide sequences of nucleic acid .fragments contributing to the 
consensus contig sequence trPEPCb 



20 



40 



60 



TrPEPCbl : 
TrPEPCb2 : 



GMAAGGGACAAGCTCTATCGTACTCGTGAGCGGTCTCGCTATCTCTip.GCTCATGGCTAT 
GSAAGGGACAAGCTCTATCGTACTCGTGAGCGGTCTCGCTATCTCTmGCTCATGGCTAT 



60. 
60 



TrPEPCbl : 
TrPEECb2 : 



80 



100 



120 



TCTGAAATTCCTGAAGAAGCCACATTCACCGATGTTGATGAGTTCT'n]GGAACCTCTTGAA| 
TCTGAAATTCCTGAAGAAGCCACATTCACCGATGTTGATGAGTTCT11GGAACCTCTTGAA 



120 
120 



140 



160 



180 



TrPEPCbl : 
TrPEPCb2 : 



CTATGCTACAGATCACTCTGTGCTTGTGGTGATCGTGCGATTGCCGATGGAAGCCTTCTT 
CTATGCTACAGATCACTCTGTGCTTGTGGTGATCGTGCGATTGCCGATGGAAGCCTTCTT 



180 
180 



TrPEPCbl 
TrPEPCb2 



200 



220 



240 



GATTTCTTGAGGCAAGTTTCCACTTTTGGACTGTCACTGGTAAGACljTGATATAAGGCAA 
GATTTCTTGAGGCAAGTTTCCACTTTTGGACTGTCACTGGTAAGACTTGATATAAGGCAA 



240 
240 



TrPEPCbl 
TrPEPCb2 



260 



280 



300 



GAGTCAGATCGTCACACGGACGTGATGGATGCCATTACCAAACATTTjGGAAATTGGATCC 
GAGTCAGATCGTCACACGGACGTGATGGATGCCATTACCAAACATTT]GGAAATTGGATCC 



300 
300 



320 



34 0 



360 



TrPEPCbl : 
TrPEPCb2 : 



TACCAAGACTGGTCTGAAGAAAAAAGACAGGAATGGCTTTTGTCTGA'GTTGGTTGGCAAA 
TACCAAGACTGGTCTGAAGAAAAAAGACAGGAATGGCTTTTGTCTGAGTTGGTTGGCAAA 



360 
360 



TrPEPCbl 
TrPEPCb2 



380 



400 



420 



aggccgctttttggacctgacctacctcaaaccgatgaaattagagaIagttttagagaca 
aggccgctttttggacctgacctacctcaaaccgatgaaattagag^^gttttagagaca 



420 
420 



440 



460 



480 



TrPEPCbl : 

TrPEPCb2 : 

) 

TrPEPCbl : 

TrPEPCb2 : 



TTTCATGTCATAGCAGAACTTCCATCAGACAACTTTGGAGCCTATATCATTTCGATGGCA 
TTTCATGTCATAGCAGAACTTCCATCAGACAACTTTGGAGCCTATAljCATTTCGATGGCA 



500 



520 



540 



ACTGCCCCGTCTGATGTGCTAGCGGTTGAACTTCTTCAACGTGAATG|CAAAATCAAGAAT 
ACTGCCCCGTCTGATGTGCTAGCGGTTGAACTTCTTCAACGTGAATGCAAAATCAAGAAT 



480 
480 



540 
540 



TrPEPCbl 
TrPEPCb2. 



560 



580 



ccgttaagagttgttccgttgtttgagaaacttgctgatctcgagtqtgctcctgctg 
ccgttaagagttgttccgttgtttgagaaacttgctgatctcgK! 



598 
: 584 



I^^e 92 Consensus contig nucleotide sequence of TrPEPCc 

* 20 * . 40 ' * 60 
TrPEPCc : GTCACATGACAAACNATATCTCCCTTTCTCTAACTCCGTGATCA^ s 60 

* 80 * 100 * 120 
TrPEPCc : CACAAATTGCTGTTAGGTTTCGTTGTACTTTCCCGTGCAATCCATAGTATCTTGGAGGAA : 120 

* 140 * 160* * 180 

TrPEPCc.': CAAACTAQATTTTCCACCTAGGTCGTCACGAGATTTTCCTCTTCACTATITTTCT^ : 180 

* • 200 * 220 * s 240 
TrPEPCc : ATATAATAACTCAACACTTTTTCTAGCTACTTACTAGTACTGTGTAACACAAATTTTATT : 240 



* 260 * 280 * 300 
TrPEPCc : CATTATGGCTACTCCTCGCAACATTGAAAAAATGGCTTCAATTGATGCTCAATTGAGACT : 300 

* 320 * 340 * 360 
TrPEPCc : ACTAGCACCAAGGT^AAGTTTCTGATGATGATAAACTTQTCGAGTATGATGCTTTG^ : 360 



* 380 * -400 * 420 

TrPEPCc : GGATCGATTCCTTGACATTCTTCAAGATTTGCATGGAGAAGATATCiliGACAAACTGTT^ : 420 



* 440 * 460 * 480 

TrPEPCc : AGATTGTTATGAGTTATCGGCAGAGTATGAAGGGGAGCTTAAGCCGGAGAAATTGGAGGA : 480 



* 500 * 520 * 540 

TrPEPCc : ACTTGGGAATATGCTTACTGGTCTTGATGCTGGAGATTCTATTGTTATAGCAAAATCATT : 540 



* 560 * 

TrPEPCc : TTCTCATATGCTTAATTTGGCAAACTTGGCAGAGN : 575 



I 




e 93 Deduced amino acid sequence of TrPEPCc 



* 20 * 40 * 60 

TrPEPCc : MA.TPI^IEK^]ASIDAQLRIJIIAPRKVSDDDKLVEYDALLLDRFLDILQDI^ 



60 



* 80 * 100 * 

TrPEPCc : CYBLSAEYEGEIiKPBKLEELGlinynjTGIjDAGDSIVIAKSFSHMLKnCiAN^ : 110 




[g 94 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contig sequence TrPEPCc 



20 



40 



60 



TrPEPCcl 
TrPEPCc2 



GTCACATGACj^AC'gATATCTCCCTTTCTCTAACTCCGTGATCAAGdCGTTAGTTAGTTA 
TGACAAACNATATCTCCCTTTCTCTAACTCCGTGATCAAGGCGTTAGTTAGTTA 



60 
54 



80 



100 



CACAAATTGCTGTTAGGTTTCGTTGTACTTTCCCGTGCAATCCATAGTATCTTGGAGG 
CACAAATTGCTGTTAGGTTTCGTTGTACTTTCCCGTGCAATCCATAGTATCTTGGAGGAA 



120 
^ : 120 
114 



TrPEPCcl 
TrPEPCc 2 



140 



160 



180 



CAAACTAGATTTTCCACCTAGGTCGTCACGAGATTTTCCTCTTCACljATTTTTCTTTTTC 

caaactagattttccacctaggtSgtcacgagattttcctcttcaciiatttttctttttc 



180 
X74 



TrPEPCcl 
TrPEPCc2 



200 



220 



240 



atataataactcaacactttttctagctacttactagtactgtgtaacacaaattttatt 
atataataat,tcaacactttttctagctacttactagtactgtgta4cacaaattttatt 



240 
234 



TrPEPCcl 
TrPEPCc2 



260 



280 



300 



CATTATGGCTACTCCTCGCAACATTGAAAAAATGGCTTCAATTGATGCTCAATTGAGACi 
CATTATGGCTACTCCTCGCAACATTGAAAAAATGGCTTCAATTGATGCTCAATTGAGACT 



300 
294 



TrPEPCcl 
TrPEPCc2 



320 



340 



360 



ACTAGCACCAAGGAAAGTTTCTGATGATGATAAACTTGTCGAGTATGATGCTTTGTTATT 
ACTAGCACCAAGGAAAGTTTCTGATGATGATAAACTTGTCGAGTATQATGCTTTGTTATT 



360 
354 



TrPEPCcl 
TrPEPCc 2 



380 



400 



420 



GGATCGATTCCTTGACATTCTTCAAGATTTGCATGGAGAAGATATCAGACAAACTGTTCA 
GGATCGATTCCTTGACATTCTTCAAGATTTGCATGGAGAAGATATCAGACAAACTGTTCA 



420 
414 



TrPEPCcl 
TrPEPCc2 



440 



460 



480 



agattgttatgagttatcggcagagtatgaaggggagcttaggccgdagaaattggagga 
agattgttatgagttatcggcagagtatgaaggggagcttaagccggagaaattggagga 



480. 
474 



TrPEPCcl 
TrPBPCc2 



500 



520 



540 



acttgggaatatgcttactggtcttgatgctggagattctattgttatagcaaaatcatt 
acttgggaatatgcttactggtcttgatgctggagattctattgtt^tagcaaaatcatt 



540 
534 



TrPEPCcl 
TrPEPCc2 



560 



ttc tcatatgcttaatttggcaaacttggcagagnI 



^^^^jie 95 Nucleotide sequence of TrPEPCd 

* 20 * 40 * 60 
TrPEPCd : AGAAGATCTCATGTTTGAGTTGTCTATGTGGaK:TGCAAC^ i 60 

* 80 * 100 - ' ** 120 
TrPEPCd : TGAAGAGCTTCATAGATCCTCAAAGAAAGATGCAAAACATTATATTGAGTTTTGGAA^ : 120 

* 140 * 160 * 180 
TrPEPCd :" GATTCCTCCAAACGAGCCATATCGTGTTATTCTTGGAGGTGTGAGGGACAAACTGTATAA : 180 



* 200 * 220 * 240 

TrPEPCd : TACACGTGAACGTGCTCGACAGTTATTAGCAAATGGAACCTCTGACATCCTTGAAGAGAC : 240 



* * 260 * 280 * 300. 

TrPEPCd : AACCTTCACGAATGTTGAGCAGTTTCTGGAGCCTCTTGAACTGTGTTATAGGTCACTTTG : 300 

* 320 * 340 * 360 

TrPEPCd : TGCATGTGGTGACCGATCAATAGCAGACGGTU^GCCTTCTTGATTTCTTGCGACAAGT^ : 360 



* 380 * . 400 * 420 

TrPEPCd : TACATTTGGACTTXCACTTGTAAGACTCGACATCCGTCAAGAGTCAGACAGGCACACAGA : 420 



* 440 * 460 * 480 
TrPEPCd • ' CGTTATGGATGCAATTACAAAACACTTGGAGATTGGATCTTACCGAGAATGGTCGGAAGA : 4 80 

* 500 * 520. * 540 
TrPEPCd : ACGCAGGCAGGAATGGCTCTTGTCTGAGCTTAGTGGAAAACGCCCTCTCTTCGGCCATG^ : 540 



* 560 * 5*80 * 600 

TrPEPCd : TCTTCCTAAGACAGAAGAAATTGCCGATGTTTTAGATACCTTNCACGTN^ : 600 



* 620 * " 640 * 660 
TrPEPCd : TNCCTCANATAGCTTTGGTGCCTATATGATCTCAATGGCAACCTCCCCATCXGATGTQCT : 660 

* 680 * 700 * 720 
TrPEPCd : AGerGTCGAGCTTTTACAACGTGAATGTCATGTGAAGCAGCCGTTAAKAGTTGTTCCACT : 720 



* 740 * 760 * 780 

TrPEPCd : GTTTGAAAAGCTCGCCNGTCTTGAGTCTGCTCCTGCTGCGGNAGCGCQTiUU"iUl^l"i'A^ : 780 



* 800 * 820 * 840 

TrPEPCd : TTGQGNCANAACCGNNNTAATGGaU^GCAGAAGTTNTGATAG s 840 



TrPEPCd : AGATGCTGGCCGNK : 854 



!^^^Pe 96 Deduced amino acid sequence of TrPEPCd 

* 20 * 40 * 60 

TrPEPCd : EDLMFBIiSMWRCiroBIiRVR?VBEIJmSSKiaDAKHYIEFWKQlPP : 60 

* * 80 * 100 * 120 

TrPEPCd : TRERARQLLANGTSDILEETTFTNVEQPLBPLELCYRSLCACGDRSIADGSLLDFIiRQVS : 120 



* 140. ' * 160' * 180 
TrPEPCd : TPGLSLWLDIRQESDRHTDVMDAITKHLEIGSYRBWSEERRQEWLLSEIiSGTO^ : 180 

* 200 * 220 * 240. 
TrPEPCd : LPKTEBIMVLDTXHXISXLXSXSFGAYIISMATSPSDVLAVEIJ^ 240 

I 

* 260 * 280 
TrPEPCd ! FEKIMLBSAPAAXJUlFXIiDWXXTXXMBSRSXDRySXXGKDAGX : 2 83 



^^^^e 91 Nucleotide sequence of TrPBFCe 

.* 20 * 40 * 60 

TrPEPCe t GTTCACTOTCTCTCTtaffCCAATTTrCCTCCCTTGTCTTCTT^^ : 60 



* 80 * ■ 100 * 120 

TrPEPCe : TCTTACTGCCTCATTACACGGGTGAGAAGGAGTGAATTGCTCCAATGGCAACAAACAAAA : 12 0 



* 140 * 160 * 180 
TrPEPCe : TGGAAM^TOGCATCAATTGATGCACAGCTTAGACyATTAGTACC;^ : 180 

* 200 * 220 * 240 
TrPEPCe : AAGATGATAAACTTATTGAGTATGATGCTTTGTTGTTGGATCGGTTTCTTGATATCCTTC : 240 



♦ . 260 * 280 * 300 

TrPEPCe : AGGATTTACATGGAGAGGATCTGAAAGATTCTGTTCAAGAAGTGTATGAACTTTCTGCGG : 300 

J * 320 * 340 * 360 

TrPEPCe : AGTATGAAAGAAAGCATGATCCTAAGAAACTTGAAGAGCTCGGAAATTTGATAACAAGTT : 360 



* 380 * 400 * 420 

TrPEPCe : TAGATGCAGGAGATTCAATTGTTGTTGCTAAGTCCTTTTCGCACATGCTTAACTTGGCCA : 420 



* 440 * 460 * * 480 

TrPEPCe : ACTTAGCTGAAGAGGTTCAGATTGCTCATCGTCGAAGGAACAAGTTGAAGAAAGGAGATT : 480 



* 500 * 520 * 540 

TrPEPCe : TTAGGGATGAGAGCAATGCAACTACCGAATCAGACATCGAAGAAACTCTTAAGAQACT : 540 



* 560 * 580 * 600 

TrPEPCe : TGTTTAATATGAAGAAATCTCCTCAGGAAGTTNTTGATGCGTTGAAGAACCKMACCGTTO : 600 



* 620 . * 640 * 660 

TrPEPCe : ATTTGGTTCTTACTGCTCATCCCACTCAGTCCGTTCGANGNCCaTCTGCTTCCCNN^ : 660 



* 680 * 

TrPEPCe : GGNACGGGNACCGCNCTGNCTATCNNACTGNNN : 693 



I^l^^e 98 Deduced amino acid sequence of TrPBPCe 

* 20- * 40 * 60 
TrPEPCe : MATNKMBKMASIDAQIiRQLVPAKVSEDDKLIEYDALLI.DRPliDILQDI^ : 60 

* 80 * 100 * 120 
TrPEPCe : YELSAEYERKHDPKKLEELGNLITSLDAGDSIWAKSFSHMIjNIAIS^^ : 120 



* 140 * 160 * 180 

TrPEPCe : LKKGDPRDESN^LTTESDIEETLKMiVFNMKKSPQEVXDA^ : 180 



TrPEPCe : LLPXAWXGXRXXVXTX : 196 



i^^^e 99 Consensus contig nucleotide sequence of TrCSa 

* 20 * 40 * T* 60 

TrCSa : casmNCSICaTACCATTACATTAATNACACTTTCCNC s 60 

. * ' 80 • . * 100 * 120 

TrCSa : TATAAAGACCAATTCAATTCCCAATTCTTTTGGATCCGAAATCATTCATTCTACGCGTCT : 120 



* . 140 * ■ . 160 * 180 
TrCSa ! TCrCTCTTCTCTGCGTTTCAAACCCTAGTTGTTTTGTTGATTGATCTAAATGGCGCT : 180 

* 200 * 220 * 240 
TrCSa : TCGAAGCGTTTCTGCGCTTTCAAAACTACGATCTCGTGTGGGTCAACAACCTAGTCTTGC : 240 



* 260 * 280 * 300 
TrCSa : TAATTCAGTTAGATGGCTCCAAACTCCAAGCTCCAGTAACACTGATCTTTATTCTGAGA : 300 

* 320 * 340 * 360 
TrCSa : GAAGGAGCTAGTTCCAGAGTATCAGGAACGTQTPTAAGAAGTTGAAGAAAQACCATGGAAG : 360 



* 380 * 400 * 420 
TrCSa : TGTTGAATTGGGAAAAATCACAGCTGATATGGTACTTGGTGGAATGAGAGGAATGACTGC : 420 

* 440 * 460 * 480 
TrCSa : TTTAGTGTGGCTAGGCTCAGCTGTTGACCCAGATGAGGGAATTCGCTTTAGGGGCATGA : 480 

* 500 * 52 0 * . 540 
TrCSa • : AATTCCTGACTGCCAGAAAACACTTCCAGGTGCTTTTCCTQGTGGGGAGCCTT^ : 540 

* 560 * 580 * 600 
TrCSa : GGCTATACTGTGGCTTCTATTGACCGGAAAGGTACCAAGTAAAGAGCAA6TAGATTCATT : 600 

* 620 * 640 * ' . * 660 
TrCSa : AGCTCACGAATTGCGAAGTCGTGCAAAAATCCCAGAGTATGCTTACAAGGCAATTOATGC : 660 

* 680 * 700 * 720 
TrCSa : ACTGCCTGTTTCTGCTCATCCAATGACACAATTTAGTACTGGTGTAATGGCCCTC^ : 720 



* 740 * 760 * 780 

TrCSa : GGAGAGTGAGTTTACAAAGGCATACGAGAGTGGGATACATAAGTCAAGGTATTGGGAGCC : 780 



* 800 * 820 * 840 

TrCSa : AACTTATQAGGATAGCTTGAATTTAATTGCTCGTTTGCCTGGAATTGCTO 840 



* 860 * 880 * 900 

TrCSa : TCGACGQATATACAAGGATGGAAA7VATCATACCATTGGATGATTCTTTGGATTATGGTGC : 900 



* 920 * 940 * 960 

TrCSa : AAACTATGCTCACATGTTAGGATTTGATGATCCAGAAACGCTGGAGTTTATGAGGCTGTA : 960 




* .980 * 1000 • * 1^20 
T?CS^ : TATTTCTATCCATAGTGATCATGAAGGNGGCAACGTTAGTTCTCACAC^ : 1020 

* 1040 ' * 1060 * 1080. 
TrCSa : TGCTAGTTCACTATCAGATCCTTATCTTQCAlTCGCaV^ • 1080 



-k 1100 * 1120 * 1140 

TrCSa : CCCACTGCATGGTTTAGCCAATCAGGAAGTTCTACGATGQATCAGAAACAT^^ 



: 1140 



* 1160 * 1180 * 1200 
TrCSa : GTTTGGAACTCCAAACa^TAAGTAaVGAACAATTGAGCGACTACATC : 1200 

* 1220 * 1240 * 1260 
TrCSa : aVGTGGCCa^GGTTGTGCCTGGATATGGACATGGAGTTTTGOT 1260 

* 1280 * 1300 

TrCSa : CACTTGCCAGAGGGAGTTTGCATTGAAGCATTTGCCTAATOATCCAN : 1307 



^^^^e 100 Deduced amino acid sec[ueiice of TrCSa 

* 20 * 40 * 60 
TrCSa : MAFFRSVSALSKLRSRVGQQPSIJUffSVRWMTPSSSNTDLYSEMI^ : 60 

* 80 * 100 . *■ 120 
TrCSa : DHGSVELQKITM)MVLGGrmGMTALVWLGSAVDPDEGIRFRGMTIPDCQKTLPGAF : 120 



* 140 * 160 ' * 180 
TrCSa : PLPEAILWLI^LTGKVPSKEQVDSWUJELRSRAKIPEYAYK^ s 180 

* 200. * 220 * 240 
TrCSa : ALQVESEFTKAYBSGIHKSRYWEPTYEDSLNLIARIiPGIAAYiyRRIYKDQKIIPI^^ : 240 



* 260 * 280 * 300 
TrCSa DYGANYAHMIiGFDDPETLEFMRIiYlSIHSDHBGNVSSHTMLVASSIiSDPYI^ : 300 

* 320 * 340 * 360 
TrCSa : LAGPLHGI^QEVLRWIRNIVKEPGTPNISTEQliSDYIHKTIJffSGQV^ : 360 



TrCSa : PRYTCQREFALKHIjPNDP : 378 




h 101 Nucleotide sequences of nucleic acid fragments contributing to the 
^OTsensus contig sequence TrCSa 



40 



60 



TrCSal 

TrCSa2 : 

TrCSa3 : 

TrCSa4 : 

TrCSaS s 

TrCSa6 : 

TrCSa? : 



GNNNCNCNACCATTACiTTAATiACAiTTTCiNCTTTCGCCTTGTTCTT^CTC 



60 
45 



100 



120 



TrCSal : 

TrCSa2 : 

TrCSa3 : 

TrC8a4 : 

TrCSaS : 

TrCSa6 : 

TrCSaV : 



TrCSal : 

TrCSeL2 : 
TrCSa3 : 
TrCSa4 : 
TrCSaS : 
TrCSa6 : 
TrCSa7 : 




160 



180 



TCTCTCTTCTCTGCGTTTCAAACCCTAGTTGTTTTGTTGATTGATCT(|A^TGGCGTTCTT| 
TCTC^CTTCTCTGCGTTTCAAACCCTAGTTGTTTTGTTGATTGATCTIljfikTGGCGTTCTT 

tctIscttctctgcgtttcaaaccctagttgttttgttgattgatcta^Jatggcgttctt 

giSS^STS icBTTT B^ACCCTHGTTGTTTiGTTGATTGATCTA^TGGCGTTCT^ 



120 
104 
40 
10 



180 
164 . 
100 
67 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSa6 
TrCSa? 



200 



220 



240 



^CGAAGCGTTTCTGCGCTTTCAAAACTACGATCTCGTGTGGGTCAACAACCTAGTCTTGC 
TCGAAGCGTTTCTGCGCTTTCAAAACTACGATCTCGTGTGGGTCAACAT^CCTAGTCTTGC 
TCGAAGCGTTTCTGCGCTTTCAAAACTACGATCTCGTGTGGGTCAACAi^CCTAGTCTgGC; 
TCGAAGCGTTTCTGCGCTTTCAAAACTACGATCTCGTGTGGGTCAACA^CCTAGTCTTGC 



240 
224 
16-0 
127 



TrCSal 
TrCSa2 
TrCSa3 
.TrCSa4 
TrCSaS 
TrCSa6 
TrCSa? 



260 



280 • 



300 



TAATTCAGTTAGATGGCTCCAAACTCCAAGCTCCAGTAACACTGATCTTITATTCTGAGAT 
TAATTCAGTTAGATGGCTCCAAACTCCAAGCTCCAGTAACACTGATCTT|TATTCTGAGAT 
TAATTCAGTTAGATGGCTCCAAACTCCAAGCTCCAGTAACACTGATCTHTATTCTGAGAT 
TAATTCAGTTAGATGGCTCC AAACTCCA AGCTCCAGTAACACTGATCTllTATTCTGAGAT 



300 
284 
220 
18? 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSae 
TrCSa? 



340 



360 



GAAGGAGCTAGTTCCAGAGTATCAGGAACGTGTTAAGAAGTTGAAGAAAGACCATGGAAGl 
GAAGGAGCTAGTTCCAGAGTATCAGGAACGTGTTAAGAAGTTGAAGAA^GACCATGGAAG 
GAAGGAGCTAGTTCCAGAGTATCAGGAACGTGTTAAGAAGTTGAAGAA^^GASCATGGAAG 



360 
344 
280 
24? 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSa6 
TrCSa7 



380 



400 



420 



TGTTGAATTGGGAAAAATCACAGCTGATATGGTACTTGGTGGAATGAGAGGAATGACTGC 
TGTTGAATTGGGAAAAATCACAGCTGATATGGTACTTGGTGGAATGAGAGGAATGACTGC 
TGTTGAATTGGGAAAAGTCACAGCTGATATGGTACTTGGTGGAATGAGJ^GGAATGACAGCI 
iTGTTGAATTGGGAAAAATCACAGCTGATATGGTACTTGGTGGAATGAGAGGAATGACTGC 
~np;TgG^RAAAggiACAGCTGATATG GTACTTGGTGGAAT GAGAGGAATGACTGC 

-'^ Ignagaggaatgactgc 



420 
404 
340 
307 
51 
16 



TrCSal 
TrCSa2 
TrCSaS 
TrCSa4 
TrCSaS 
TrCSaS 
TrCSaV 



440 



460 



480 



TTTAGTGTGGCTAGGCTCAGCTGTTGACCCAGATGAGGGAATTCGCTTT|AGGGGCATGAC| 
TTTAGTGTGGCTAGGCTCAGCTGTTGACCCAGATGAGGGAATTCGCTT1}AGGGGCATGAC| 
TTTAGTGTGGCTAGGCTCAGCTGTTGACCCAGATGAGGGAATTCGCTTT]AGGGGCATGAC, 
TTTAGTGTGGCTAGGCTCAGCTGTTGACCCAN>TGAGGGAATTCGCTTTAGGGGCATGAC 
TTTAGTGTGGCTAGGCTCAGCTGTTGACCCAGATGRGGGAATTCGCTTT]AGGGGCATGAC 
^^^„_^^^^^^^^«^o^a-«?,^r.r-T-n'rTr;aprrAnATnAGGGAAT TCGCTT'l]AGGGGCATGAC 



480 
464 
400 
367 
111 
74 



520 



540 



TrCSal 
TrCSa2 
TrCSa3 
'ErCSa4 
TrCSaS 
TrCSa6 
TrCSa7 



AATTCCTGACTGCCAGAAAACACTTCCAGGTGCTTTTCCTGGTGGGGAQCCTTTGCCCGA 
AATTCCTGACTGCCAGAAAACACTTCCAGGTGCTTTTCCTGGTGGGGAGCCTTTGCCCGA 
AATTCCTGACTGCCAGAAAACACTTCCAGGTGCTTTTCCTGGTGGGGAGCCTTTGCCCGA 
AATTCCTGACTGCCAaAAAACACTTiCAGGTGCTTTTiCTGG'glGGGGAGllCTTTGaCClA 

aattcctgactgccagaaaacacttccaggtgct|ttcctggtggggagcctttgcccga 
aattcctgactgccagHaaacacttccaggtgcttttcctggtggggagcctttgcccga 



540 
524 
460 
427 
171 
133 



580 



600 



TrCSal 
TjrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSa6 
TrCiSa7 



ggctatactgtggcttctattgaccggaaaggtaccaagtaaagagcaa^gtagattcatt 
ggctatactgtggcttctattgaccggaaaggtaccaagtaaagagcas^gtagattcatt 

GGCTATACTGTGGCTicgATTGACCGGAAAGGTACCAAGTAAAGAGCA^jGTAGATTCATTi 

GGCTATACTG§GGi?TTgTATTGACCGGj]S 

GGCTATACTGTGGCTTCTATTGACCGGAAAGGTACCAAGTAAAGAGCA^^GTAGATTCATT 

p;r;r-TaTArTGTGGCTTCTATTGACC GGAAAGGTACCAAGTAAAGAGCAA] GTAGATTCATT 



600 
584 
520 
456 
231 
193 



TrCSal 
. TrCSa2 
• TrCSa3 
TrCSa4 
TrCSaS 
TfCSa6 
TrCSa7 



620 



640 



660 



aatcccagagtatgcttacaag;gcaattgatgc 




AGCTCACGAATTGCGAAGTCGTGCAAAAATCCCAGAGTATGCTTACAAG|GCAATTGATGC 
anPTrarnaATTnrnAAGTCGTGCAAAAATCCCAGAGTATGCTTACAAG.GCAATTGATGC 



660 
588 
580 

291 
253 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSa6 
TrCSa7 



6B0 



700 



720 



lACTGCCTGTTTCTGCTCATCCAATGACACAAaNjl 



PGCCTGTTTCTGCTCATCCAATGACACAATTTAGTACTGGTGTAATG'PCCCTCCAGGT 



ACTGCCTGTTTCTGCTCATCCAATGACACAATTTAGTACTGGTGTAATG'pCCCTCCAGGT 
ACTGCCTGTTTCTGCTCATCCAATGACACAATTTAGTACTGGTGTAATG.GCCCTCCAGGT 



TrCSal 
TrCSa2 

TrCSa4 
TrCSaS 
trCSaS 
TrCSa? 



740 



760 



780 



GGAGAGTGAGTTTACAAAGGCATAiGAGAGTGGGATACATg 



GGAGAGTGAGTTTACAAAGGCATACGAGAGTGGGATACATAAGTCAAGSTATTGGGAGCC^ 
GGAGAGTGAGTTTACAAAGGCATACGAGAGTGGGATACATAAGTCAAGGTATTGGGAGCC 

ISSiiAGAGTGGGABN.CWTBAGTBAAGSBATTGGGAGCC 



68X 

41X 
373 
34 



8Q0 



820 



840 



TrCSal ^ 

TrCSa2 

TrCSa3 

TrCSa4 

TrCSaS 

TrCSa6 

TrCSa7 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
trCSae 
TrCSa7 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSaS 
TrCSa7 



TrCSal 
, TrCSa2 
TrCSa3 
TrCSa4 
trCSaS 
' TrCSae 
TrCSa7 



TrCSal 
TrCSa2 
trCSa3 
TrCSa4 
TrCSaS 
TrCSa6 
TrCSa7 



Sac^^Sgatagcttgaatttaattgctcgtttgcctggaattgctgcctatattta 

AACTTATGAGGATAGCTTGAATTTAATTGCTCGTTTGCCTGGAATTGCTGCCTATATTTA 
BACTTATGfiGGAxBGCTTGAATTTAATTGCTCGTTTGCCTGGAATTGCTGCCTATATTTA 



TCGACGGATATACAAGGATGGAAAAATCATACCATTGGATGATTCTTTGGATTATGGTGC 
TCGACGGATATACAAGGATGGAAAAATCATACCATTGGATGATTCTTTGGATTATGGTGC 
TCGACGGATATACAAGGATGGAAAAATCATACCATTGGATGATTCTTTGGATTATGGTGC 



a^c^Stgctcacmgttaggatttgatgatccagaaacgctggagttitatgaggctgta 

AAACTATGCTCACATGTTAGGATTTGATGATCCAGAAACGCTGGAGTTTATGAGGCTGTA 

aaactatgctcacatgttaggatttgatgatccagaaacgctggagtttatgaggctgta 



tatttctatccatagtgatcatgaaggk^^^BEEBBBSBSS^B^BBBEDESSESB 

TaTTTCTATCCATAGTGATCATGAAGGjGGGAACGTTAGTTCTGAGACgjGCTCACCTAGT 



TrCSaS 
TrCSa6 
TrCSa? 



CCCACTGC 



392 



1160 



1180 



1200 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSaS 
TrCSa? 



.^^^^^,.J;^^.^...^^7^o7^r^-7^z.r^BnTT^ar!rGACTACATTCA^?AAAACATTGAA 



3TTTGGAA 



452 



1220 



1240 



1260 



TrCSal 
TrCSa2 



TrCSa3 
TrCSa4 
TrCSaB 
TrCSa6 
TrCSa7 



CAGTGGCCAGGTTGTGCi 



.^^^^^^^o^.^r-^Tlr-.r;ar!TTTTnCGCAA TACAGACCCAAGATfl 



512 



1280 



1300 



TrCSal 
TrCSa2 
TrCSa3 
TrCSa4 
TrCSaS 
TrCSa6 
TrCSa? . 



^^^^^^^^^^„^^.n^^.^TTr-,raTTnAAGCA TTTGCCTAATGATCCAI>j 



559 




!e 102 Consensus contig nucleotide sequence of TrCSb 



* ■ 20 * 40 ♦ • 60 

TrCSb : CNTTTCNTTTCCACyMSCATCCTAATCC^TCCTAATCC^ 



60 



80 * 100 * 120 

TrCSb ATTACTAATTACEVGTACTAATTAGTAAT^ACCGATCCCTTTTTCTGGJ^CCaVTO : 120 

X40 * 160 * ' 180 

TrCSb : AAGMAGAAGAAGGAAAAACAAAATCCACACAAAOVAACATCTTACAACAATGTCAACGAC : 180 



200 * -220 * 240 

TrCSb :. AACTACTACAACCGA 



CQAATCCAAGCTGCACGACGCTGCACGGAACCGTTTGGCCACCCT : 240 



• * • 260 * 280 * 300 

TrCSb : CTCAGCTCACTTGCTTCCTTCCTCCACAACCTCCOCOGCGCTCCTCCATCCTATTCACCT : 300 

I * • 320 * 340 ♦ 360 

TrCSb s TTCTTCTTCCTCCGGQATCTCCCCACCGTCTAATGTCAAAGQAACACTCACCGTTGTTGA.: 360 



380 * 400 * 420 

TrCSb : TGAACGTACCGGGi 



iAAGAAGTATACCATTGAGQTCTCTCCTGATGGCACCGTTAAAGCCAA : 420 



* 440 * .460 * 480 

TrCSb : TGATTTCAA3AAGATA' 



.TCAACTGGGAAGAAT6ATAASGGACTCAAACTTTATGATCCTGG : 480 



* • 500 * 520 * 540 
TrCSb : aTATTTAAACACTGCTCCTGTGCGATCAACAATTTCTTATATTGATGGTGATGAGGGAAT : 540 

* 560 * 580 * 600 
TrCSb : CCTTAGATATAGAGGATACCCCATTGAGGAGTTGGCCGAGAAAAGCACCTTTCCaGAAGT : 600 

i * 620 * 640 * 660 

TrCSb : GGCATATCTCATATTGTATGGAAATTTGCCTTCTGCAAATCAGTTACAAGAATGGGAATT : 660 

* 680 * 700 * 720 
TrCSb : TGCTATATCTCAQCATTCAaCCTTACCTCAAGGAGTTTTGGATCTCATACAATCAATGCC : 720 



740 * 760 - * 780 

TrCSb : T 



CAAGATGCACATCCTATGGGCGTCCTAGTGAATGCAATAA3CGCTCTGTCT6TTTTTCA : 780 



* 800 * 820 . * 840 
TrCSb : TCCTGACGCAAATCCTGCTCTCAGAGGTCTTGACATCTACAACTCAAAGCAAQTOAGAOA ; 840 

* 860 * 880 * • 900 
TrCSb : CAAACAAATAaCACGOATTATTGOAAAQATAACAACAATTGCTGCTGCAATTAATCTtAG : 900 

920 * 940 • * 960 

TrCSb : AATGOCAGGAAGGCCACCTOTGCTTCCATCCAACAAACTATCTTACACAGAOAACTTCCT : 960 




« 980 * 1000 * 1020 

_ - - — — — 

TrCSb : ATACA' 



.TGCTTCmTTCTCTAGSCAATCGGTOVTATAAACCCAACCCTCAGCTAAClX^ : 1020 



* 1040 * • 1060 * 1080 

TrCSb : ACTAaACATCATCTTCATCCTOCATGaMaAACATQAAATOAAT^^ 1080 



* 1100 * 1120 * 1140 

'TrCSb • : ACGACACCTTGCA' 



.TCAAGCGQCOTCGATGTATACACIGCTATTGCTGGAGGTGTTOaAQC : 1140 



1160- * 1180 * „:__^??£ 

TrCSb : TCTGTi 



■ATQQACCtCTTCATGOTQGAGCTAATGAQGCGGTCCTTAAAATGCTGAGTGAAAT : 1200 



* 1220 * 1240 

TrCSb 



TGGAAGTGTCQATAACATTCCAaAGTTCATTGAAGGTGTTAANM. : 1244 




103 Deduced amino acid sequence of TrCSb 



20 



40 * 60 



TrCSb : MSTTTTTTDESKLHDAARNKIATLSJ^1.PSSTTSAALLHPIHLSSSS^^ « 6° 

80 • * • * 120 • 

TrCSb : TVVDERTGKKJTTIEVSPDGTVKAlTOFKiaSTGKNPKGLKLYDPGyi^^ 120 

140 * 1«0 * 

TrCSb : DE01I.RyRGYPIEBIJ^KSTPPEVAyi.II*yGNLPSANQLQBWEPAISQHSAl.PQGVLDLl s 180 



* 200 • 

Ircsb : qsmpqdahpmqvlvnaisalsvfhpd; 



220 * 240 



iJUJPALRGIiDIYNSKQVRDKQIARIIGKITTIAAA : 240 



* 260 * 280 * 

TrCSb : INLRMAGRPPVI.PSim.SyTElIFI.YmJ3SLGimSYKPNPQLTRAIJ3IIPII.HAE™ : 300 



) * 320 * ■ 340 

TrCSb I TSAVRHLASS( 



iGVDVYTAIAGGVGALYGPmGGANEAVLroJLSEIGSVDNIPBPIBQVX s 3S8 



I 



I 




_le 104 Nucleotide sequences of nucleic acid fragments contributing to the 
consensus contig sequence TrCSb 



TrCSbl 
TrCSb2 
*rCSb3 
TrCSb4 
TrCSbB 
■frCSbS 
TrCSb7 



20 



40 



60 



60 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
•TrCSb6 
TrCSb? 



) 



80 



100 



120 



Tar-TaaTTAfTAnTACTAATTAGTAATA CCGATCCCTTTTTCTCGAACCCATTCATTC 



180 



TrCSbl : 
TrCSb2 
TrCSb3 

TrCSb4 : ! 

TrCSbS 
TrCSb6 : 
TrCSb? : 




120 



179 
55 
58 
50 
45 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSb6 
TrCSb? 



I 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSb6 
TrCSb? 



220 . 



240 



AACTACTACAACCGACGAATCCAAGCTGCACGACGCTGCACGGAACCGTrTTGGCgACCCT 
AACTACTACAACCGACGAATCCAAGCTGCACGACGCTGCACGGAACCGTTTGGCCACCCT| 
AACTACTACAACCGACGAATCCAAGCTGCACGACGCTGCACGGAACCGTTTGGCCACCCT 
AACTACTACAACCGACGAATCCAAGCTGCACGACGCTGCACGGAACCGTTTgGCCACCCT 
CT- CT" " r-r^r-rv r-r^TN 7^-^r-r^an^;r'Tr!ParGACGCTGCACGG AACCG^iTTGGCgjACCCT 



260 



280 



300 



CTCAGCTCACTTGCTTCCTTCCTCCACAAgCTCCGCiGCGCTgCTCCATCCTATgCACCT| 
CTCAGCTCACTTGCTTCCTTCCTCCACAACCTCCGCCGCGCTCCTCCATCCTATTCACCT 
CTCAGCTCACTTGCTTCCTTCCTCCACAACCTCCGCCGCGCTCCTCCATCCTATTCACCT 
CTCAGCTCACTTGCTTCCTTCCTCCACAACCTCCGCCGCGCTCCTCCATCCTATTCACCT 
icTCAGCTCACTTGCTTCCTTCCTCCACAAiCTCCGCiGCGCTiG.CTCCATCClATgCACCT 



239 
115 
118 
110 
105 



299 

175 
178 
170 
165 



340 



.360 



TrCSbl 
TrCSb2 
*rCSb3 
TrCSb4 
TrCSbS 
TrCSbe 
trCSb? 



TTCTTCTTCCTCiSGGGATCTCCCCACCGTCTAATGTCAAAGGAACACTGACCGTTGTTGA 
TTCHCTTCCTCCGGGATCTCCCCACCGTCTAATGTCAAAGGAACACTGACCGTTGTTGA 
TTc'gicTTCCTCCGGGATCTCCCCACCGTCTAATGTCAAAGGAACACTGACCGTTGTTGA 
TTCTTCTTCCTCCGGGATCTCCCCACCGTCTAATGTCAAAGGAACACTGACCGTTGTTGA 
iTTCTTCTTCCTCglGGGATCTCCCC ACCGTCTAATGTCAAAGGAACACTgACCGTTGTTGA 



359 
235 
238 
230 
225 




TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4. 
TrCSbS 
TrCSb6 
TrCSb? 



400 



420 



TGAACGTrcCGGGAAGAAGTATACCATTGAGGTCTCTCCTGATGGCACCfeTTAAAGCCAAl 
TrAACGTACCGGGAAGAAGTATAScATTGAGGTCTCpCCTGATGGCACCGTTAAAGCCAA 
TGAACGTACCGGGAAGAAGTAT?^CATTGAGGTCTC|JCCTGATGGCACCGTTAAAGCCAA 
TGAACGTACCGGGAAGAAGTATACCATTGAGGTCTCTCCTGATGGCACCpTTAAAGCCAA 



419 

295 
298 
290 
285 



TrCSbl 

TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSb6 
TrCSb? 



440 



460* 



480 



TGATTTCAAGAAGATATCAACTGGGAAGAATGATAAGGGlCTCAAACTTTATGATCCTGG 
TGATTTCAAGAAGATATCAACTGGGAAGAATGATAAGGGACTCAAACTTTATGATCCTGG 
TGATTTCAAGAAGATATCAACTGGGAAGAATGATAAGGGACTCAAACTTTATGATCCTGG 
iTGATTTCAAGAAGATATCS'ACTGGGAAGAATGATAAGGGACTCAAACTTTATGATCCTGG 
U.^^.rnr.7.n-z.nr.AT-ZVTrAACTGGGAAGAATG ATAAGGG!3,CTCAA ACTTTATGATCCTGG 



479 
355 
358 
350 
345 



TrCSbl 
T^CSb2 
TrCSb3 
trCSb4 
TrCSb5 
TrCSb6 
TrCSb? 



520 



540 



ATATTTAAACACTGCTCCTGTGCGATCAACAATTTCTTATATTGATGGTGATGAGGGAAT 
ATATTTAAACACTGCTCCTGTGCGATCAACAATTTCTTATATTGATGGTGATGAGGGAAT 
ATATTTAAACACTGCTCCTGTGCGATCAACAATTTCTTATATTGATGGTGATGAGGGAATi 
ATATTTAAACACTGCTCCTGTGCGATCAACAATTTCTTATATTGATGGTGATGAGGGAATi 
,,_^_^_^^^^_^_^^^^^^^r^^nT-riT^nnr.ATraarAATTTCTTATAT^^ 



539 
415 
418 
410 
405 



580 



600 



TrCSbl 
T^rCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSbe 
TrCSb? 



CCTTAGATATAGAGGATACCCCATTGAiGAGTTGGCCGAGAAAAGCACCTTTCCGGAAGT 
CCTTAGATATAGAGGATACCCCATTGAGGAGTTGGCCGAGAAAAGCACCITTTCCGGAAGTi 
CCTTAGATATAGAGGATACCCCATTGAGGAGTTGGCCGAGAAJ^GCACCjTTTCCGGAAGT; 
CCTTAGATATAGAGGATACCCCATTGAGGAGTTGGCCGAGAAAAGCACCTTTCCGGAAGT 
CCTTAGATATAGAGGATACCCCATTGAlGAGTTGGCCGAGAAAAGCACCpTTCCGGAAGT 
ATAGAGG§THcCNATTGAGGAGTTGGaCGAGAAAAGCAC!r,|rTT;A5?GGAAGT 



599 
475 
478 
470 
465 
49 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSb5 
TrCSb6 
TrCSb7 



620 



640 



660 



GGCATATCTN 

GGCATATCTCATATTGTATGGAAATTTGCCTTCTGCAAATCAGTTACAAGAATGGGAATT 
GGCATATCTCATATTGTATGGAAATTTGCCTTCTGCAAATCAGTTACAAGAATGGGAATT 
GGCATATCTCATATTGTATGGAAATTTGCCTTCTGC7VAATCAGTTACAAGAATGGGAATT 
GCATATCTCATATTGTATGGAAATTTGCCTTCTGCAAATCAGTTACAApAATGGGAATT 
..Sna^nrpr^T^Wan-nigT^nTATnr.AAf^'TTTaCCT gCTGiAAgTiAGTTAGGTj pAATGGg^^^ 

" ■■ — — GAATGGGAATT 



609 
535 
538 
530 
525 
108 
12 



TrCSbl 
TfrCSb2 
TrCSb3 
TrCSb4 
TrCSb5 
TrCSbe 
TrCSb? 



680 



.700 



720 



TGCTATATCTCAGCATTCAGCCTTACCTCAAGGAGTTTTGGATCTCATACAATg 
TGCTATATCTCAGCATTCAGCCTTACCTCAAGGAGTTTTGGATCTC^ 

TGCTATATCTCAGCATTCAGCCTTACCTCAAGGAGTTTTNjj^BBBBBBIH 

TGCTATATCTCAGCATTCAGCCTTACCTCAAGGAGTTTTGGATCTCATACAATCAATGCC 

ggctatatctcagcattcagcsdgtIcc^gaaggagttttggatItcatacaatcaatgcc 



589 
59.4 
570 
585 
168 
70 




740 



760 



780 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSbe 
TrCSb7 



TCAAGNISI 

,TCAi.GMGCACATCCTATGGGi4GTCCTAGTGi^TGC^^TAAGCGCTCTiTCTGTTTTTCft^ 
Tr-AnnATnrAraTrCTATGGGCGT;G CTiGTffiAATGC ,^STAAGrgGCTrRTGTCTGTTTTTCA 



592 
228 
130 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSbS 
TrCSb7 



800 



820 



840 



TCCTGACGC'GAATCCTGCTCTgAGAGGTCTTGATiATiTACgACTCAAAG|AAGTGAGAGA 
Tc^T^^^'^'^^'^^'^^^^^'^^^'T^'-'T-ranAGGTCTTGACATC TACAACTCAAAGC AAGTGAGAGA 



288 
190 



TrCSbl 
TrCSb2 
*TrCSb3 
TrCSb4 
TrCSbS 
TrCSbe 
TrCSb7 



860 



880 



900 



CAAACAAATAGCACGGATTATTGGAAAGATiBAiAACAATTGCTGCTGCAETTrgATCTTAG 
CAAACAAATAGlScGGATTATTGGAAAGATAACAACAATTGCTGCTGCg^lTAAT^^ 



348 
250 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSbe 
TrCSb7 



920 



940 



960 



AATGGCAGGAAGGCCACCTGTGCTTCCATCCAAC'gAACTATCTTACACKGAGA^^ 
AET^nr.^annAAGnr.CACCTGTiScTTCCATCCAACAAACTjTCTTACACAGAGAACTTCCT 



408 
310 



TrCSbl 
TrCSb2 
TrCSbS 
TrCSb4 
TrCSbS 
TrCSbe 
TrCSb7 



980 



1000 



1020 



ATACATGCTTGATTCTgTAGGCAATCGGTCATATAAACCCAACCCTCAGCTAACTCGTGC 
g^^^^n^r^ ,vn.rpr^rT,r->riSinr:r A ATrnnTCATATAAACcS AAlCCTCG-flCTAACTCGTCC 



468 
370 



1040 



1060 



1080 



TrCSbl 
TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSbe 
TrCSb7 



ACTAGACAT&iATCTTCATCCTGCATGCAGAACATGAAATGAATTGCTCTfACATCTGCTGT 
TxnrrHr^Ar^n^nTx^n'TTrA-rrrTaCATGCAGAACATGAAATGAA TTGCTCTACAT^ 



TrCSbS 
TrCSbS 
TrCSb7 



•GCATCAAGCGGCGTiGATGTATAgACTGCTATTGCTGGgdH 

^gcatcaagIggSgtcgatgtatacactgctattgctggacpgtgttggagc 



579 
490 



TrCSbl 
.TrCSb2 
TrCSb3 
TrCSb4 
TrCSbS 
TrCSbe 
TrCSb? 



X160- 



1180 



1200 



tctgtatggacctcttcatggtggagctaatgaggcggtccttaaaatgctgagtgaaat 



550 



TrCSbl 
TrCSb2 
rrrCSb3 
TrCSb4 
TrCSbS 
TrCSb6 
TrCSb7 



1220 



1240 



Tnr.AAOTnTCGATAACATTCCAG AGTTCATTGAAGGTGTTAANN 



.594 



